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^ (54) Title: DIMERIC FLUORESCEOT POLYPEPTIDES 
tH 

^ (57) Abstract: The invention relates to proteins or pol^ptides chat comprise intramolecular dimers of flooiescent protein 
^ monomers. Mote specifically, the invention relales t recombinant polypqjtides comprising a monomer of a finorescent polypeptide, 
^ a linlEBT peptide, and a second monomer of that fluorescent polypeptide, where the monomers fbnn an intramolecular dimer. llie 
2 invention also relates to nucleic adds encodii^ Intiamolicular Dimer Hnorescent Proteins (IDFPs) and vectors comprising such 
a niclcic acids. The invention further relates to methods of making IDFPs and methods of using them. IDFPs arc uscfiil in any 
Q application suited for fluorescent proteins and are particulariy useful in ^plications in which more than one fluorescent piotein 
1^ sharing complementary dimerization inteifaces is present in die same mixdire or is expressed in the same cell, because IDFPs do 
^ not form hctcrodimcrs. 
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DIMEmC FLUORESCmr POLYPEP^ 
BACKGROUND OF THE INVENTION 

Fluorescent pix)teins are widely used in the fields of biochemistry, molecular and cell 
biology^ medical diagoostics and drug screening methodologies (CSialfie et aL, 1994, Sdence 
263: 802-805; Tsien, 1998, Ann. Rev. Biochem. 67: 509-544). One property shared by the most 
useful fluorescent proteins is that fliey require no host-encoded co-factors or substrates for 
fluorescence. The proteins therefore retain flieir fluorescmt properties bofli in isolation fiom 
their native organisin, and when e)qxressed in the celb of other organists This property makes 
them particularly well suited for a variety of in vivo and in vitro f^lications. Another major 
advantage of fluorescent protdns for use in biological systems is that they are indeed proteins, 
which permits their synthesis within cells or organisms of interest, avoiding a host of problems 
relating to the attachment of the label to a protein of interest and/or delivery of labeled proteins 
mto a cell. Not only can the proteins be made within the desned cell or organism, but they also 
retain their fluorescent prop^es when expressed as fusions with other proteins of interest, 
which greafly enhances their utility both in vivo and in vitro. 

FluorescCTt proteins have been used as r^rter molecules to study gene expression in 
culture as well as in transgenic animals by insertion of fluorescent protein coding sequences 
downstream of an a|>propriate promoter. They have also bem used to study the subcellular 
localization of protefais by dnect fusion of test proteins to fluorescent proteins, and fluorescent 
proteins have become the reporter of choice for monitoring the infection efficiency of viral 
vectors both in cell culture and in animals. Variants of fluorescent proteins exhibiting spectral 
shifts in response to changes in the cellular environment (e.g., changes in pH, ion flux, or tiie 
redox status of the cell) are also used to monitor such changes (see, for example, Inouye & Tsuji, 
1994, FEES Lett 351: 211-214; Miyawaki et al., 1997, Nature 388: 882-887). 
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P^faaps the most promising role for fluorescent proteins as biochemical markers is tibeir 

application to methods that eT^loit ftoorescence resonance energy transfer (FRET). FRET occurs 

with fhiorophores for which the emission spectrum of one fluorophore overlqis with the 

excitation spectrum of a second fhiorophore. When such fhion>phores are brought into close 

proximity, excitation of tiie ^donoif * fhiorophore results in raussion fiom the '^acceptor''. Pans 

of such fluoiophores are thus use&l for monitoring molecular interactions. Fluorescent proteins 

are useful for the analysis of proteinrprotein molecular interactions in vivo or in vitro if their 

respective fluorescent emissicm and excitation spectra overlq> to allow FRET. Thedonorand 

acceptor fluorescent proteins may be produced as fusions with the proteins one wishes to analyze 

fbrinteiractions. Thesetypesof ^ypUcations of fluorescent proteim are particul^ 

lii gli rtiimi ghpi it analyses, since the readout is direct and independent of subcellular localization. 

The prototypical fluorescent protein is the Aequorea victoria green fiuorescmt protein 
(GFPX which was Hie first green fluorescmt protein cloned (Prasher et al., 1992, Gene 111: 229- 
233). Purified A victoria OFF is a monomeric protein of about 27 kDa that absorbs blue li^t 
with an excitation wavelength maTinmni of 395 nm, with a minor peak at 470 nm, and emits 
greea fhiorescmce with an emission wavelength of about 510 nm and a minor peak near 540 nm 
(Ward et al., 1979, Photochem. Photobiol. Rev. 4: 1-57). Hie polypeptide has several 
drawbacks, including relatively broad excitation and emission spectra, low quantum yield, and 
low expression in cells of higgler eukaryotes. Mutants with improved spectral characteristics and 
higher quantum yield have been identified, and expression in higher eukaryotes has been 
improved by *liumanizing'' the nucleic acid sequences to encode codons optimized for human or 
mammalian expression. 

Additional fluorescent proteins include, but are not limited to those expressed by 
Discosoma sp. md Phialidium gregarum (Ward et al., 1982, Photoch^ Photobiol. 35: 803-808; 
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Levine c* al., 1982, Comp. Biochem. Physiol. 72B:77-85). Also, Vibrio ftscheri strain Yl 

eqxresses a yellow fluorescent protein that requires flavins as a co-fector for its fluarescence 

(Baldwin et al., 1990, Biochemistry 29: 5509-5515). 

Additional cloned fluorescent proteins inchide, for cxamplt, the green fluorescent 
proteins fiom flie sea pansy, Renilla nadlerd (WO/99/49019) and fi»m RNolia reniformis (see 
SBQ ID NO: 1; Figure 1). Each of fliese fluorescent proteins and others are useful for a variety of 
in vivo and in vitro uses. The IL rmiformis GFP (rGFP) clone is particularly inqportant, since 
iGEHP is seen as fiie bendunaik protein among known natturally-occuiiing fluorescent proteins. 
rGFP has 3 to 6-fold higher quantum yield than A. victoria GFP, and flie excitation and mission 
spectra are narrower, making rGFP more suitable for qiplications involving, for example, FRET. 

One major drawback shared by the GFPs fiom A. victoria^ R. mullerei and JL rentformiSj 
as well as by all known variants of diose proteins, is that they are dimeric. Generally, the 
proteins exist as homodimers. However, wh^ more than one form of a given GFP is exiM:essed 
in a single cell or is mixed in vitro, heterodimers can form if the dhnerization mterfeces for the 
different fluorescent proteins are complranentary. Heterodimerization mterf^res with the 
nsefufaiess of fluorescent proteins for several reasons. 

First, heterodimerization is undesirable when fluoroscent proteins are nsed m energy 
transfer-based analyses because heterodimerization raises the background of acceptor 
fluorescmce without a real interaction betwem the proteuis or protein domains of interest 
When FRET is used, for example to monitor protemrprotein interactions, donor and acceptor 
fluorescent fusion proteins aroofieneTqpressed in &6sanieceU or othenivisen^ Ihthe 
absence of hetmxlim^rization, the excitation of the donor fiuorophore leads to emission by the 
acceptor fiuorophore only if the two fusion proteins are in close apposition. However, if 
heterodunerizadon occurs between the differing fluorescent proteins (e.g., between a wild-type 
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rGFP and an rOFP variant tfiat is a fluorescence donor to the wild-type GFP), excitation of flie 

donor will result in mission by ttie acc^tor regardless of the intmtction between the fused 

polypq)tides being examined far interaction. Tliis generates an unaccqrtably high bacl;;ground 

fluorescence fiom flie acceptor fhiorophore. 

Another problem caused by the hetmdimerization is that flie dimerization inter&ces 
between the proteins can serve to arti&ctually bring fusion polypeptides linked to flie flucniescent 
protein monomra into close contact The in^ipropriate recruitment of proteins into close 
supposition can have biological come(piences that make da^ For 
example, some cell sur&ce receptors gain die ability to initiate an intracellular signaling cascade 
following ligand-induced dimerization. If the dimerization mterfaces of the flaorescent protenis 
insqppropriately recruit the fiised recq>tor monomers into close contact, the signaling cascade can 
be inappropriately initiated in tibe absence of ligand. There is a need in the art for fluorescent 
proteins tiiat do not heterodimerize. 

U.S. Patent No. 5,981,200 (Tsien et al.) teaches donor and acceptor fluorescent proteins 
linked by a peptide linker. The linked donor and acceptor proteins, referred to as '^dem 
fluorescent jproteins," are taught to be useful for assaying enzymes arable of cleaving die linker 
peptide sequence. When linked, the tandem fhiorescent proteins exhibit either no fluoresces 
(e.g.^ when one protein quenches the flucxtescence of the other) or fluorescence characteristic of 
the accqptor. Following cleavage, the fluorescmce emitted is that characteristic of the individual 
fluorescent proteins. Assays using this airangement will not work unless the tandem fluorescent 
proteins are related as donor and acceptor. 

SUMMARY OF THE INVENTION 

The invention encompasses a recombinant fiision polypeptide conq)rising a first 
polyp^tide peptide bonded to a second polypq)tide, wherem the first and second polypqrtides 
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are found in nabue as monomers of a multimeric protein, and herein the recombinant fusion 

polypeptide is fhiorescrat yvhsa exposed to li^t of an excitation wavelength or when interactive 

with an excited donor fluorophore. 

In one embodiment, the first polypeptide and the second polypeptide are peptide bonded 
to each other via a link^ sequence* 

In anotb^ mibodiment, the recombinant fiision polypq)tide fiirtfaer comprises a thiid 
polypeptide peptide bonded to die recombinant fusion polypeptide. The third polypeptide can be 
peptide bonded to the reccmibinant fiision polypeptide either directly or through a peptide linker 
sequence. A recombinant fiision polypq;>tide of fliisanbodiment is refe^ 
as a 'fluorescent polypeptide fiision.*" In a preferred »ibodim^t, the third polypq)tide is fused 
to the anuno terminus of tbe first polyp^tide. In another preened embodiment flie third 
polyp^tide is fused to flie carboxy terminus of the second polyp^tide sequence. 

In an additional preferred embodiment, the third polypeptide is a member of a specific 
binding pair. 

Iq anotber embodiment, one or both of the first and second polypeptides is a monomer of 
one of renifomds GFP, IL muUeri GPP or A. victoria GFP. 

In another embodimmt, both of the first and second polypqitides are a monomer of one 
of il reniformis GFP, R nudleri GFP or A. victoria GFP. 

The invention further encompasses a polynucleotide encoding a recomhmant fusion 
polypeptide conqnising a first polypq)tide peptide bonded to a second polypeptide, wherein the 
first and second polypeptides are found in nature as monomers of a multimeric protein, and 
wherein the recombinant fusion polypeptide is fluorescent when eTqiosed to lig^ of an excitation 
wavelength or when interactive with an excited donor fluorophore. 
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la one embodiment, the first polypeptide and flie second polypq)tide encoded by the 

polynucleotide aiepq[)tide bonded to eac^ other via a Inapreferred 

embodiment, the linker sequence encoded by Ihe polynucleotide is fiom S to 50 amino adds 

long, hi a further preferred embodiment, the linker sequence comprises one or more iterations of 

a peptide, for example the peptide RARDPRVPVAT (i.a, Arg-Ala-Aig-Asp-Pro-Arg-Val-Pro- 

Val-Ala-Thr). In a fiirther prefened embodimrat, flie linker sequence is selected fmm the group 

consisting of (Arg-Ala-Arg-Asp-I^Arg-Val-Pro-Val-Ak-Thr)^ (Gly-Ser)^, Crhr-Ser-Pro)n, 

(Gly-Gly-Gly)n, and (Ghi-Lys)n, wherem n is 1 to IS. 

In another embodiment, the polynucleotide further encodes a third polypeptide pqrtide 
bonded to flie iecombinantfiisionpoIypq>tide. The third polyp^tide encoded by the 
polynucleotide may be joined directly or via an encoded peptide linker. 

In a preferred embodiment, the third polyp^tide encoded by the polynucleotide is a 
meaziber of a specific bindmgpau-. It alternatively piefmed that the ttubrdCTcodedpoly^ 
fused to the amino terminus of the first polypeptide. Is additionally preferred tiiat the thnd 
encoded polypeptide is fiised to the carboxy terminus of the second polypeptide. 

In another jn'eferred embodunent, one or botii of tiie first and second polypeptides is a._ . 
monomer of one of A rentformis GFP, R. muUeri GFP, A. victoria GFP. 

hi another preferred embodiment both of the first and second polypeptides is a monomer 
of one of it reniforniis GFP, R. mutteri GFP, A. victoria GFP. 

The invention further encompasses a vector comprising a polynucleotide encoding a 
recombmant fusion polypeptide compri^g a first polyp^de pq>tide bonded to a second 
polypeptide, wherem the first and second polypeptides are found in nature as monomers of a 
multimeric protein, and wherein the reconibinant fusion polypeptide is fluorescent when exposed 
to light of an excitation wavelength or whra interactive with an excited donor fluorophore. 

6 
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The invention ftrlher encompasses a cell comprising a vector comprising a 

polynucleotide encoding a recombinant &sion polypeptide comprising a first polyp^de pqytide 

bonded to a second polypeptide, herein the first and second polypq>tides are found in nature as 

monomers of a multimeric protein, and wherein die recombinant fusion polypeptide is 

fluorescent yvhsa exposed to lig^t of an excitation wavelength or when interactive widi an 

excited donor fluorophore. 

In one embodimmt, the cell is a bacterial cell. 

h another embodiment, the cell is a eukaryotic cell. Id a preferred embodmient, the 
eukaryotic cell is a yeast cell, an insect cell, or a mammalian cell. 

The invention fiirther encompasses a pair of polypeptides comprising a polypeptide 
labeled with a fluorescent dye and a recombinant fiision polypeptide comprising a first 
polypqrtide peptide bonded to a second polypeptide, wherein the first and second polypeptides 
are found in nature as monomers of a multimeric protein, \^erein the fusion polypeptide is 
fluorescent when exposed to light of an excitation wavelength or when intraactive wilh an 
excited donor fluorophore, and wherein the fluorescent dye and the recombinant fiision 
polypeptide are fluorescent donor and accqptor to ^bother. ^. 

The invention further encompasses a pair of recombinant fusion polypeptides oomprismg 
(a) a first fusion polypeptide comprising a first polypq)tide peptide bonded to a second 
polypeptide, i^erem the first and second polypeptides are found in nature as monomers of a 
multimeric protein, and wherein tixe first fusion polypeptide is fluorescent when e3q)osed to light 
of an excitation wavelengdi or when interactive witii an excited donor fluorophore, and (b) a 
second fusion polypeptide comprismg a third polypeptide peptide bonded to a ftfurth 
polypeptide, wherein the third and fourth polypeptides are found m nature as monomers of a 
multimeric protein, and wh^in the second fusion polyp^tide is fluorescent when exposed to 
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light of an excitation wavelength or when mteractive widi an excited donor fluorophore, wfaetem 

the first &sion polypeptide and the second fbsion polypq)tide are fluorescent donor and acceptor 

to each odi^. 

In one embodimrat, each of tiie first and second fusion polypeptides further con^rises an 
additional fused (third) polypeptide^ wherein the additional fused polypq>tide of the first fusion 
polypeptide conqmses a sequence which is diffei:ent from the additional fused polypeptide of the 
second fusion polypeptide. 

Hie invention further encompasses a method of producing a fiuorescently labeled 
recombinant fusion polypeptide, the method conq)rising the steps of introducing to a cell a 
polynucleotide encoding a recombinant fusion polypq>tide conqxiising a first polypeptide p^tide 
bonded to a second polypeptide, wherrin the first and second polypqrtides are found in nature as 
monomers of a multimeric protein, and wherein the recombinant fusion polypeptide is 
fluorescent when exposed to light of an excitation wavelengtii or when inta:active witii an 
excited donor flu(m)phore, and cultuiing the cell under conditions that permit the syntiiesis of die 
recombinant fusion polypqptide, whereby the recombinant fusion polypeptide is produced. 

The invention fiuifaer encon^asses a method of labeling a ciell with a flnoiescent 
recombinant fusion polypq)tide, the metiiod comprising die steps of: a) inlzoducing to a cell a 
polynucleotide encoding a recombmant fusion polypeptide comprising a first polypeptide peptide 
bonded to a second polypeptide, wherdn the first and second polypeptides are fiiund in nature as 
monomers of a multimeric protein, and wherein the recombinant fusion polypeptide is 
fluorescent when exposed to ligjbt of an excitation wavelmgth or when interactive with an 
excited donor fluorpphore; and b) culturing tiie cdl under conditions that permit the synthesis of 
the recombinant fusion polypeptide, whereby the cell is labeled with the fluorescent recombinant 
fusion polypeptide. 

8 



wo 02/048174 PCT/US0iy48690 
Id a prefened embodiment, in the intioducing step (a), tiie polymicleotide intxoduoed to 

tibe cell furliier cdopises a sequence encoding a tfiird polypeptide fiised in frame to the sequence 

encoding the recombinant fiision polypq)tide. 

The invention further encompasses a mediod of monitoring the interaction of two 
polypeptides of interest, the mediod comprising the steps of: a) contacting a fluorescent 
polypq)tide fusion, as described above, and a second polypeptide wherdn: i) the fluorescent 
polypq)tide fusion comprises a first polypeptide of interest; ii) the second polypeptide comprises 
a second polypeptide of interest and is fluorescendy labeled; and iii) the fhiorophores conqxrised 
by the fluorescent polypeptide fusion and the second polypeptide are fluorescent donor and 
fhiorescrataccc^rtoeachothenb) exciting the don(»rfluorophore; and c)d^ecting 
fluorescent emission fiom die flunrescrait acceptor, ixiierein the emission is indicative of die 
interaction of die first and the second polypeptides of interest 

In one embodimrat, die second polypeptide conqxrises a second fluorescent polypq)tide 
fusion, as described above, wherein flie polypeptide of interest of the second fluorescent 
polyp^tide fusion is different fix)m the polypq)tide of interest of the first fluorescent 
polypqitide fusion. 

In one embodiment, the contacting step is performed in vitro. 

In another embodiment, the contacting step is performed in a cell. Inapreferred 
embodiment; the contacting comprises die st^ of introducing nucleic acid encoding the 
polypeptides to a cell. 

The invention further enconq>asses a method of screening for a compound diat modulates 
the interaction of a first and a second member of a specific binding pair, die method con^sing 
the steps o£ a) contacting a first polypeptide and a second polypq)tide in the presence and 
absence of a candidate modulator wherein: i) the first polypeptide is a fluorescrat polypq>tide 

9 
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fusion, as desoribed above, wherein flie fliird polypeptide is the first member of a specific 

binding pair; ii) fhe second polypq)tide is ftuorescently labeled and conq^rises the second 

msoA&c of a specific binding pair; and iii) the flnorophores conqnrised by the first and second 

polypeptides are fluorescent donor and accq;>tor to each other, b) exciting the donor fiuorophore; 

and c) detecting the fluorescence of the acceptor fludophDre^ wherein emission of the spectrum 

characteristic of the fluorescent acceptor indicates the interaction of the first and tiie second 

members of the specific binding pair, and wherein a change in the interaction in the presence of 

&e candidate modulator indicates that the candidate modulator modulates the interaction of the 

mranbers of ttie specific binding pair. 

In one embodiment the second polypeptide is a fluoresceaitpolypqitide fusion, as 
desoibed above, which emprises the second member of a cfpedfic binding pair. 

BRIEF DESCRIPTION OF THE HGURES 

Figure 1 shows the polynucleotide sequence of A reniformis GFP (SEQ ID NO: 1). 

Figure 2 shows the ammo add sequmce of R. reniformis GFP (SEQ ID NO : 2). 

Figure 3 shows the polynucleotide and amino acid sequences for farGFP, a humamzed R. 
reniformis GFP. Ihe polynucleotide sequence is SEQ ID NO: 3, aiidtiie amino acid sequence is 
SEQ ID NO: 4. 

Figure 4 shows a schematic diagram of a construct encoding an IDFP of the invention. 
"CMV^ refers to the cytomegalovirus promoter, "MCS** refers to a multiple cloning sequence 
and **pA*' refers to a poly(A) addition site sequence. *1n<3FF' rqiresents one monomer of the 
humanized R. renifisnnis GFP, and ''linked' refers to a pqrtide or polypeptide linker sequence. A, 
B, and C show examples of linker peptide sequences. 

Figure S shows relationships between emission and excitation peaks for donor and 
acceptor fluorophores capable of FRET. 

10 
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DETAnJSD DESOUPnON OF TEffi INVENTS 

All patents and patent applications, both U.S. and international, and all Uteratme 

publications referred to herdn aie hereby incoxporated in their endiety witliin this document by 

reference. 

pefinitions 

As used herein, a recombinant fusion polypeptide is "^uorescent when excited". 

As used herefai, the term "excited" refers to a fluorophore diat is exposed to light of an 
excitation wavdengdi or to an accq)tor fluorophore diat is interactive with an excited donor 
fluorophore. 

The phrase '"fluorescent when exdtecT means that when the recombinant fusion 
polyp^de is e3^sed to light of an excitation wavelength or iTrtim the polypqrtide interacts 
witfi an excited donor fluorophore, the polypeptide fluoresces. ^'Exposed to light of an excitation 
wavelength" means irradiated widi light (electromagn^c radiation) within a given spectrum of 
wavelengths that is absorbed by flie polypeptide sudh that the polypq)tide emits light having a 
different spectrum of wavelengths, and thus fluoresces. Fluorescent emission occurs at a longer 
wavelength dian does excitation. 

A recombinant fusion polypq>tide according to die invention has three properties: 1) it 
must emit lig^ upon irradiation with li^^t of a given wavelength or wavelengths; 2) it must have 
tiie cq>acity to form an intramolecular homodimer as defined hssKsa above; and 3) the first and 
second polypeptide monomers that constitute the fusion polypeptide cannot function as 
fluorescent donor and fluorescent accqptor, respectively, in the context of fluorescence resonance 
energy transfer. 

As used herein, the term "light of an excitation wavelength" refers to those wavelmgths 
of light that are absoibed by and excite a given fluorophore to emit fluorescence. These 

11 
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waveleng&s are described in detail herem below. Light of an qypiopriate portion of the 

spectrum is synonymous widi light within tbe excitation spectrum of a given fluorophore. 

As used herem, the tesrm ''excited donor fluorophore*' refers to a fluorophore which has 
absorbed energy witiiih its excitation spectrum. An excited donor fluorophore can transmit 
energy sufficient to excite an acceptor fluorophore. 

As used herem, the temi 'ftuorescnt dye" refers to a non-polypeptide chemical moiety 
that, upon absorption of light energy of a particular wavelength or wavdenglhs, emits lig^it at 
another wavelength or that emits lig^ wh» paired with an appropriate excited donor 
fluorophore. 

Wbsa referring to membm of a pair of fluorophores (ie., fluorescent dyes or 
polypq)tides) tihat can undergo fhu»:esc»ce resonance energy transfer (FRET), flie fluorophore 
diat ^ts at a wavelengdi or spectrum of wavelmgths that excites the other memb^ of the pair 
is re&ned to as the *'fluorescmt donor'' or '"fluorescence donor^. Conversely, the meniber of the 
pair that emits in response to excitation by the fluorescence donor is termed the '"fluorescent 
acc^tor'' or '"fluorescence acceptor^'. The members of such a pair are said to be "fluorescent 
donor and acceptor to each oth^." According to the invention, the fluorescence donor and 
fluorescence acceptor polypq)tides are not linked by peptide bonds. In one embodimmit of the 
invention, either of the fluorescence donor or acceptor, but not both, may be a norHpolypq>tide 
fluorescent dye (also not covalently linked to eadi otfa^). 

As used herein, the term ''fluorescently labelecT means, when referring to a polypeptide, 
that the polypeptide is covalently attached to a fluorescent moiety. A polypeptide may be 
fluorescentiy labeled by cbvalent attachment to a non-polypq>tide fluorescent dye, or 
alternatively, by e)q)ression as a fusion protein with a fluorescent polypeptide. 
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In nature and as used herein, a fluorescent polypeptide is distinguished fiom a 
huninescent polypeptide in that a fluorescmt polypq?tide requires an input of electromagnetic 
energy in ord^ to emit li^t, while a hmiinesceut polypqptide ^ts Hg^ in response to release 
of chraiical energy. A luminescent polypq)tide may serve as a donor of excitation energy for a 
fluorescent polypeptide (in &ct; tiiis is exactly what happcm in nature when, for e?cample, 
Jlentffaluciferaseenuts energy that excites iJcntf/flGFP). A fusion polypeptide according to tiie 
invention may or may not be luminescent 

As used hj»ein, flie tenn '•recombinant refers to a polynucleotide that has bem isolated 
from its natural environment using recombiaant DNA techniques, or synthesized, or to a 
polypeptide expressed from sudi a polynucleotide. A recombinant polypeptide may be idoitical 
to or different from a naturally occurring polypeptide, as long as it is expressed firam a 
recombinant polynucleotide. 

As used herem, the term **mDnomer^ refers to a single polypq>tide molecule that exists as 
a dfancar or heterodimer or other multimar (e.g., a trimor, quadramer, pentamer, etc.) m a 
multimeric protein. A "monomer" interacts with anotiier monomer, e.g., in a dimer, via a 
specific sequence referred to herein by the equivalent t^ms 'interaction dpi^m" ^ 
'interaction interfece-. In a "dimer^ the appropriate equivalent terms for the sequences that 
mediate the interaction are *'dimerization domain" and '*dimerization interfece." 

A monomer of a fluorescent polypeptide maybe fiiU length, for example, as tiie 
polypeptide occurs in nature, or it may be longer or shorter than the naturally occurring 
polypeptide, so long as it retains the two requisite properties. 

A recombinant fiisicm polypeptide accordmg to the invention may comprise first and 
second polypeptides which exist in nature as non-peptide-bonded monomers of a multimeric 
protein. Thus, the term *taonomer^ is used wifli respect to what is fi)undmnatu^ 
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In a fusion polypeptide according to the invention, fliese fiist and second polypeptides are 

peptide bonded and fonn a single chain polypeptide. However, die peptide-bonded first and 

second polypq)tides retain flie ability, indepeadenfly, to interact with a donor or acc^tor 

fluoiophore and fluoresce. This is believed to be a result of the intramolecular interaction of the 

monomeES and the ability of die intramolecular dimer Hm formed to be excited at an excitation 

wavelengdi of light and to act as a fiuorescmt donor or acceptor. 

As used herein, die term "linker sequence** refra to a sequmce of peptide bonded anjino 
adds that j oins or links by pqptide bonds two amino arid sequences or polypeptide domains 4at 
are not joined by peptide bonds in nature. A linker sequence is encoded inflame on a 
polynucleotide between the sequences encoding the two polypeptide domains jomed by die 
linkOT. A linker is preferably 5 to 50 amino adds in length, m<M»prefis^ 
acids in lengdL An example of linkers useful in die invention are die Gly-Ala linkers taught by 
Huston et al., U.S. PatentNo. 5^8,498, mcoiporated herein by reference. Additional useful 
linkers include, but are not limited to (Arg-Ala-Arg-Asp-Pro-Arg-Val-Pro-Val-Ala.Thr)M PCu et 
al., 1999, Proc. Natl. Acad. ScL U.SA. 96: 151-156), (Gly-Ser)„(Shao et al., 2000, Bioconjug. 
Caiem. 11: 822-826), Crhr-Ser-Pro)a(Kroon ct al.,2000, Eur. J. Biodiem. 267: 6740-6752), (Gly- 
Gly-Gly)a (Kluczyk et aL, 2000, Peptides 21: 1411-1420), and (Ghi-Lys)n(Klyczyk et aL, 2000, 
supra), wherein n is 1 to 15. 

As used herein, the term "specific binding paif ' refias to a pair of polypq)tides fliat 
physically mteract in a specific manner that gives rise to a biological activity, diat is, to die 
substantial exclusion of odi«: polypq)tides. Members of a specific bindmg pair mteract dnough 
complementary interaction domains, such fliat tiiey mteract to the substantial exdnsion of 
proteins that do not have a con5)lementary interaction doinain. Non-limiting examples of 
q)ecific binding pairs mctade antibody-antigen pairs, razyme-substrate pairs, dimeric 
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transcription fiactors (e.g., AP-1, composed of Fos specifically bound to Jiin via a leucine zipper 
interaction domain) and recq)tor-ligand pairs. 

As used herein^ tiie term '^amrno terminus" refers to Hie last amino acid at the amino end 
of a polypeptide, where the last amino add is not peptide bonded to another amino acid. 

As used herein, Ae terai "carboxy terminus*' refers to the last amino acid at the caiboxyl 
md of a polypeptide, where tiie last amino acid is not peptide bonded to another amino acid. 

As used herein, the tenn "labeling a ceU" refers to Hie expression of a fluorescent 
polypeptide in a cell, such that tfie oeD is detectable by irradiating the cell with li^t wifliin die 
excitation spectrum of the fluorescOTt polypeptide and monitoring or detecting ^nission within 
the radssion spectrum of the polypq>tide. Acellmaybelabeledbyexi»»ssionof afiuorescmt 
polyp eptide that localizes anywhere m flie cell, mchidmg, but not limited to the cell surfice, die 
cytoplasm, the nucleus or to particular organelles sudi as mitochondria, lysosomes, endosomes, 
golgi q)paratus, endoplasmic reticulum or other specific sub-celhdar locale. 

As used herein, the term *Sntroducing a nucleic add into a cell" or 'introducing a 
polynucleotide mto a cell" refers to die process whereby a recombinant polynucleotide is put into 
a cell. Mefliods for inttodacing a nucldc acid to a cdl will vary with the nature of the ce^ and 
the nature of the chosen vector, but one of skill in die art may readily select and anploy a known 
mediod appropriate for a given cell type and vector. 

As used herem, die tenn ""culturmg a cell under conditions that permit the symhesis of a 
recombinant polypeptide" refers to the maintraiance of cells comprising a polynucleotide 
encodmg a recombinant polypeptide m growdi medium and under environmental conditions 
(e.g., tempearatm^, pH, redox and osmotic conditions, O2 and CO2 concentrations and presoice or 
absence of an effective concentration of an qipropriate expression-modulating agent such as 
IPTG or tetracycline) condudve to tiie syntihiesis of die recombmant polypeptide. One of skill in 
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file art is assumed to be capable of mamtaining yeast, insect, mammalian or other cells under 
conditions that peormit the syndiesis of a recombinant polypeptide according to tiie invention. 

As used herein, the term "Monitoring die mtoracdon'* re^ to the process whereby Ihe 
physical association of two polypq)tides or a polypeptide and another entity are measured. As 
relates to the invention, die term refers most fiequendy to detection or measurem^ of 
association or interaction using FKBT. 

As used herein, die term "Mtramolecular dunef' refers to a dimer formed by the covaleat 
peptide linkage oftwo polypeptide monomers. An '^itramolecular dimer fluorescent proteuT 
(IDFP) is an intramolecular dimer m which die linked polypq>tides which exist in nature as 
monomers of a multimeric protein are fluorescent polypeptides. According to the mvention, die 
linked monomm of an n>FP are not fluorescent donor aiul acceptor to eadiodiOT. An IDFP 
fusion protein"* is an IDFP which is fused to a protein of interest or to a fragment of a protein of 
interest 

As used herdn, die tenn ''protein of interest ' ref^ to a polypeptide, or a domain 
(firagment) of a polypeptide, that is selected to be fused to an IDFP. Any polypeptide or 
fiagment of a polypq>tide for whidi a polynucleotide sequence is known can be fiised to an 
IDFP by standard techniques known m the art A protemofinterest according to die invention 
eidier does not alter the fluorescence characteristics of the fused IDFP, or, if it does alt^ those 
characteristics, the alteration is such diat the alteration does not intediro with die intended use of 
the IDFP fusion protein. 

As used herein, die term ^'detecting fluorescence*' refers to die process whereby the 
fluorescent emission by a fluorescent polypq)tide is measured or det^nuiied. Fhiorescence 
detection mediods include quantitative and qualitative methods adapted for standard or confocal 
imax>scopy, F ACS analysis, and diose adapted for high throughput mediods involving multiwell 
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plates, airays or mi(aroarrays. One of skiU iB liie art can select appropiiate fflte 
excitation energy sources for Ate detection of fluorescent ^nission fiom a given fluorescent 
polypeptide or dye. 

As used herein, the tom ""candidate modulator" refers to an agpnt being evaluated for its 
efifect on the function of a polypeptide or the interaction of members of a specific binding pair. 
Exemplary sources and types of candidate modulators useful according to the invention are 
described herein below. 

As used herein, tiie t^m ^change in interaction" or **modulation of infection" refers to 
an increase or decrease in die level of interaction detected between members of a specific 
bmdingpair. As used herem, the levdofinteraction is considered increased if the detect^ 
interaction goes up by at least 10%, and preferably by 20%, 35%, 50%^ 75%, or more, up to and 
including 2-fold, 5-fold, 10-fold, 20-fold, 50-fold or more relative to a standani As used 
hoein, die level of interaction is considered decreased if the detected interacticm goes down by at 
least 10%. and preferably by 20%, 35%, 50%, 75%, 90%, 95%, 98%, 99% or more, up to and 
including 100% (no interaction) relative to a standard. 

As used herein, the term '^^e polypeptide diain" refers to a polypeptide chain in 
which all amino acids are linked sequentially by peptide bonds. A ''single polypqitide chain*" is 
one generated by translation of a single mKNA template and may oacompass one or more 
polypeptide domains, inchiding one or more r^eats of the sequence conqorismg one polypeptide 
or polypq)tide domain. 

As usedherefai, the term '^lypeptide domain" refers to a sequence of amino adds tiiat 
exhibits one or more discrete binding or functional properties. As used herein, binding or 
functional properties include binding to one or more polypeptides, modulation of the binding of 
one or more polypeptides, recognition by an antibody or antigen binding firagment thereoi^ 
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binding to a coenzyme, ion, or ofha: ligand, catalytic activity or inhibition of catalytic activity, 
fluorescence and Imninescnce. In this context, non^limiting examples of polypeptide domains 
include a DNA binding domain and a kinase domain. 

As used herein, the term 'liomodimer^ refers to a protein complex coniprised of two 
identical copies of the same monomer. 

As used herein, the term "interact means fliat two molecular species physically assodate 
with each otiier. The association tiiat is diaiacterized as an interaction can invol^^ 
diarg^ mteractions, charge-dipole urtCTactions, dipole-dqwle interactions, van der Waals forces, 
hydrogen bonding and/or hydrophobic forces. 

As used hefein, Ifae term 'Specific binding^ means Ifae specific recognition of one of two 
different molecules for the otii^ compared to substantially less recognition of otiier molecules. 
Members of a specific binding pair have a particular afBnity for each otiier tiiat gives rise to a 
biological activity. Generally, tiie molecules have areas on tiieir sur&ces or in cavities giving rise 
to specific recognition between the two molecules. Exemplary of specific binding are antibody- 
antigen interactions, enzyme-substrate intmctions, polynucleotide interactions, and so fortit 

As used herein, the tram ''specifically dimerize" means that two monomer^iiiffefiil in the 
invention interact via an interaction domain present on each monomer, to the substantial 
exchision of polypqitides lacking fliat interaction domain. ''Specifically homodimerize" means 
tiiat tiie monomers tfiat interact via a shared interaction domain, to tiie substantial exchision of 
polypeptides lacking that interaction domain, fomi a homodimer as defined herein. "Substantial 
exchision*' means tiiat at a given time in a sample, less tiian 0.1 % of the monomraSy.and 
preferably less than 0.01%, 0,001% or fewar monomers are physically associated witii 
polypeptides tiiat do not have a complementary interaction domain. 
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As used hexein, the temi "^varianr lefets to a polypeptide diat differs in amino add 

sequence fiom a parent polypeptide yet retains liie fimction of the parent polypq)tide. A variant 

fluorescent polypeptide may, for example have one or more amino acid msertions, deletions or 

substitutions that do not alter ability of the polypeptide to emit fluorescence iq)on excitation or 

interaction with a donor or acceptor fluoxophote. A variant jQuorescmt polypeptide according to 

the invention has the ability to form an intramolecular homodimer as defined h^:ein. 

As used herein^ the t^m '"d^ved fiom" refers to a polypq)tide that differs in amino acid 
sequence finm a ref^ence polypeptide used as the teEiq>late or starting sequence fbr generating 
or deriving the differing sequence. For example, a fluorescent polypq>tide can be derived fix>m a 
wild-type fluorescmt polypqrtide (i.eu, a refiarence polypeptide) by random or site-directed 
mutagenesis, including insertions, delrtions or truncations or fusions. A fluorescent polypeptide 
derived firom a wild-type polypeptide can have different fluorescence characteristics than die 
wild-type polypeptide. 

As used hmin, the torn '^uorescence characteristic" refers to a property of the 
excitation or emission by a fluorescrat polypeptide. Fluorescence characteristics include, for 
exan^le, the waveleng1h(s) at wMch a fluorescent polypeptide is excited or at yMch it emits 
(including tiie breadth and amplitudes of tiie spectra for each), the extinction coefScient or 
intensity of die emission, quantum yield or the efSdracy of emission, and resistance or 
susceptibility to photobleacfaing. Table 2 provides examples of excitation maxima, emission 
maxima, extinction coef&dent and quantum yield for a variety of fluorescrat polypeptides. 

As used herein, the temi '^spectrum characteristic of a fluorescent accep W refers to the 
emission spectrum of a given fluorpphore that is being used as the fluorescence acceptor in an 
acceptor/donor pair. 
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Detailed Description of Ihe Invention 

In one aspect, the inyention relates to dimeric fluorescent proteins that avoid the 
problems caused by heterodimerization. In this aspect, heterodjmeiizatiQn is avoided by fosing 
two monomers of the fluorescent polypeptide using a linker peptide. The close spatial 
relationship of flie fused monomers strongly favors the formation of a dimer b^een die two 
fiised monomers, to the essential exclusion of other monomers sharing a similar dimerization 
inter&ce. Theinteraction ofthe fused monomers via their respective dimerizati^^ 
refi^rod to herein as Intramolecular dunerization". An intramolecular duner fhunescent protem 
(IDEP) does not conpdse fluorescent monomers that are related to each other as donor and 
acceptor. That is, the monomers that are linked in an IDFPcamiot undergo FRE^ 
IDFPs may be co-expressed within ttie same cell or otherwise mixed widi distinct fluorescent 
proteins comprising the same fhiorescent protem dimerization inter&ces without encountering 
the problems caused by heterodim^ formatioa 

In order to make an IDFP, the nucleic add encoding a monomer of a fluorescent protein 
is joined in flame at its 3' end to a sequence encoding a peptide linker, which is itself joined in 
frame to another copy of the nucleic add encoding die mcmpmer. This sequence may and often 
will be additionally hnked in frame to a sequence ncodmg a polypeptide of interest, for 
exanople, a polypeptide being investigated for interaction with another protein. Translation of 
- Hie mRNA encoded by such a nucldc add construct generates fhe fluores cexxt monomers in such 
close proximity to each other that intramolecular homodimerization of the monomers is very 
strongly &vored over intennolecularheterodim^izatioiL The resulting polypeptide therefcm 
comprises an intramolecular homodimer of the fluorescent protein monomers, fiised to a protein 
of interest 
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Fluorescent Proteins Useful According to the Invention 

Any fhioiescent protein tliat homodiinerizes in a cell canbe usefiil in generating an IDFP 
of the invention. GFPs StomAequorea victoria, Renilla renffomds ondRenitta mulleri^ among 
others, are homodimers as they exist in nature. Any oftiiese proteins, and any mutants or 
engineered versions of lliese proteins ttiat retain Ihe ability to homodimerize may be used to 
generate an IDFP of the invention. 

In ord^ to genmte an IDFP according to the invrntion, the fhiorescent protein or the 
natural protein it was derived from (e.g.,i&. /iOT>&mwGFP)miistformhomodimers^*en 
esqxressedinamonomericfbrm. It is g^erafly known in the field wheth^ a given protein exists 
asahomo-orhetmdimfirinvivoorifitha8diecq)acitytohoniodimmze. In the event that 
such knowledge is not available, there are a number of ways m which one of skin in tiie art may 
determine whether a particular fluorescent protein homodimerizes. First, biophysical methods 
such as X-ray crystallogr^hy, nuclear magnetic resonance, radiation target analysis or mass 
spectrom e try can be used to determine whetii^ a polyp^tide dimerizes. 

A biochemical ^proach is to ftactionate samples of purified proteins by size selection 
g^l chromatography under denaturing versus non-denaturing conditions and analyze fractians for 
the fluorescent protein by fluorescence. If flie fluorescent protein migrates at a larger size 
(approximately twice as lar^) under non-denatuiing conditions relative to denaturing conditions, 
it is an indication that the protein is a dhner under native conditions. Bxamples of commonly 
used matrices include, for example, Sephadex (G10-G200), Bio-Gel (P-2 - P-300) and 
Sepharose (2B» 4B, etc.) matrices. One of skill in the art may readily select a size separation 
matrix appropriate for such analyses. If performed with purified protein this method can indicate 
whether or not a polypeptide homodimerizes. If the method is applied to non-purified protein. 
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for example, to protein extracts, the assay only indicates that a dimer fonns with some 
polypeptide and fuither analysis is lequiied to d^emune if tbe dimer is a homodimer* 

Anotiier biochCTucal method of investigating dimer formation is to generate a truncated 
or elongated form of the protein and mix it, eiflier by co-expression or by mixing of isolated 
proteins, Willi the wild*type protein. If hoxaodimeirs can form, there will be fliree distinctly sized 
bands following native g^l electrophoresis: 1) a homodimer of the wild-type; 2) a homodimer of 
the elongated or truncated form; and 3) an intermediate-migrating diagnostic heterodimer 
complex of the wild-^e and the tnmcated forms. In the absence of dimerization, only bands (1) 
and (2) will &snxL 

Additionally, homodim^ formation is detected by the method of analytical 
ultiacmtrifagation (Baird et al., 2000, Proc Nad Acad Sci U S A., 22: 1 1984-9). 

Exanq»les of known fhioresc^t proteins that can be expressed as intramolecular dimers 
are as follows. SEQ ID NO: 1 (Figure 1) is the nucleotide sequence encoding wild-type rGFP, 
and SEQ ID NO: 2 (Figure 2) is the amino add sequence of wild-type rGF^^ Aprefmed 
embodiment of the IDFP conq)rises two copies of the wild-type rGFP polypeptide, linked by a 
pq>tide linker sequence. Another embodiment enconq)asses the same rGFP IDFP additionally 
fiised in frame to a protein of interest Anyproteind^vedfiromtherGFPof SEQIDNO: 2can 
be used to generate an IDFP ofthe invention as long as it retains the abihty to hoi^ Li 
a preferred embodiment, the polynucleotide sequence encoding a fhioresc^t polypeptide (e.g., 
rGFP of SEQ ID NO: 2) is a humanized polynucleotide rGFP coding sequence, also referred to 
herein as hrGFP. Figure 3 shows a humanized polynucleotide sequence (hrGFP) and the rGFP 
sequence it encodes (SEQ ID Nos: 3 and 4, respectively). 

The amino acid and nucleotide sequences of A. victoria GFP are known in the art 

(Prasher et al., 1992, supra) and vectors racoding a variety of mutant >4. victona-derived GFPs 
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are also known and are ftequenlly commercially available. For example, Heim et al. (1995, 

Nature 373: 663-664) teadies mutations at S65 of i4. victoria GP? tiiat enhance Ae floorescence 
intensity of the polypeptide. Hie mutant containing flie S65T mutation is particulariy important, 
since its fluorescence is ^proximately 35 times as intaise as wfld-typCil \nctoria GFP, and its 
emission spectrum is shifted to the red, makmg it more amenable to standard Aodamine optics 
(excitation and emission maxima at 489 nm and 508 nm, respectively). An S65T mutant 
encoded by a construct comprising humanized codons is known as EGFP, or "enhanced CTF' 
(available ftom CSLONTECH; see GenBank Accession No. U4328^. 

The EGFP mutant is die cornerstone of a series of commertaally-available CTP mutants 
that have differing emission specbca. and other nseM ^gineered properties (Cormack et al., 
1996, Gene 173: 33-38; Yang et aL, 1996, Nud. Acids Res. 24: 4592-4593; Crameri et al., 1996, 
Nature Biotecbnol. 14: 315-319) Each protein in the series contams nratations in addition to tiie 
S65T and humanizing nmtations, that altar die emissiwi characteristics of the protdns. For 
example, the cyan fluorescent protein known as ECFP contains six mutations ttiat shift tiie 
anission to cyan Vl^A (excitation and emission maxima at 434 nm and 477 nm, respectively, see 
GenBank Accession No. AB041904 and Sawano et aL, 2000, NucL Adds Res. 28: e780,. The 
blue fluorescent protem known as EBFP contains four nmtations that shift die onission spectnun 
to bhie (QccitaticHi and emission maxima at 380 nm and 440 nm, respectively). The yellow 
fluorescent protein known as EYFP (see Qraao et al.. Science 273: 1392-1395, clone GFF-IOC) 
contains mutations shifting tiie emission to yellow-grerai (exdtatimi and emission maxima at 514 
nm and 527 nm, respectively). EGFP, EC3PP, EYFP and EBFP are aU available fiom 
CLONTECEL 

The S65 site has received considerable scrutiny for its role m detetmining the 
fluorescence diaracteristics of the A victoria GFP molecule. Additional mutants at S65 include, 
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for example, S65A, S65C and S65L, each of which have excitation and emission maxima Aat 
• difEBrftomwUdrtypeiiviawioQFP(seeTable2). The nucleotide sequence encoding an S65 A 
mutant is available as GenBank Accession No. U56996. One skilled in flie art can iairoduce 
mutations necessary to alter S65 to any desired amino acid. Similarly, the additional point 
mutations d^ed in Table 2 can be generated by one of skill in tiie art 

Oth» floorescent proteins useful according to ftie invention include, tot txmplo, A. 
victoria-dermd GEPs that are optimized for expression in plants (GenBank Accession No. 
U87625 and WO 96/27675), are less Iheimosensilive (GenBank Accession No. U87973), or are 
more soluble and emit blue fluorescraice (GenBank Accession No. U70497). A. victoria GFPs 
targeted to specific organdies have also been described, such as tiiose targeted to fl» 
mitochondria and flie nucleus (Rizzuto et aL, 1996, Cuir. BioL 6: 183-188). This listing is by no 
means exhaustive. There are, fijr exan^le, a numbo: of fluorescent protein variants, both derived 
fiom ^ Wcrorto and ftom ofter sources, fliat have been reported in or are the STibje^ 
mtemational patents and patent applications, for example, U.S. Patent Nos. 6,124,128, 
6,066,476, 6,020,192, 5,804,387, 5,874,304, 5,968,738, 5,625,048, and 5,777,079, and PCT 
i^licationNos. WO 98/21355, WO 98/00737, WO 97/20078, WO 97/42320 and WO 
97/1 1094. Fluorescent prot^ variants are also described in a number of additional publications 
in 4e scientific literature, including, for example, Ehrig et aL, 1995, FBBS Lett 367: 163-166); 
Surpin et al., 1987, Photochem. PhotoWol. 45 (Suppl): 95S; and Delagrave et aL, 1995, 
BioTechnology 13: 151-154. Any and all of tiie fluorescent proteins taught in these sources and 
elsewhne are candidates for the generation of IDFPs of die invention, provided that diey 
homodimerize and the sequences encoding them are known. 

The red fluorescent protem fiom the Ihdo-Padfic sea anemone of flie Discosoma species 
is also a candidate fi>r IDFP generation according to die invention (see MaXz et al, 1999, Nature 
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BiotecbnoL 17: 969-973). The sequence eocoding the piotein, known as 'DsRed** is available at 
GeoBank Accession No. AF27271 1, and vectors encoding the piotein aie conunecdally 
available (C3-0NTECH). 

Linker Sequences Usefhl Acoording to the hvention 

Linker sequences usefiil according to the invention serve to join monomers in the dimeric 
fluorescent polypeptides of the invention. A linker is preferably about S to about SO amino acids 
in Iragdi, and more preferably about 10 to about 20 annno acids in length. An example of linkers 
useM in the invention are the Gly-Ala linkeis tau^t by Huston et aL, U.S. Patent No. S;2S8,498, 
incorporated herein by referaice. Additional useful linkers include, but are not limited to (Aig-- 
Ala-Aig-Aq>-Ito-Aig-VaI-Pro-Val-Ala-Thr)i4 (Xu et al., 1999, Proc. NatL Acad. Sci U.S.A. 
96: 15M56), (Gly-Ser)ta(Shao et al., 2000, Bioconjug. Chem. 11: 822-825), (nir-Ser-Pro)a 
(Kroon et al., 2000, Eur. J. Bioch^L 267: 6740-6752), (Gly-Gly-Gly>i (Khiczyk et aL, 2000, 
Peptides 21: 1411-1420), and (Ghi-Lys)B(Klyczyk et al., 2000, supra), whwein n is 1 to 15 (each 
of the {Hfeceding references is also incorporated herein by reference). 
Proteins of Merest 

Frequently it will be advantageous to express an IDFP of the invmticm as a fusion with a 
protein of interest The protein ofinterest can be any protein for ^ch the nudeic ad 
is known and for which tbat sequence or at least a relevant part of that sequence can be cloned 
into a vector encoding an IDFP. By relevant part is ineant a dotnainoftnterestwitfain a protein, 
for example, a domain bdng evaluated for protein**proteih interactions or a domain with catalytic 
activity. As used herein, the term **pn)tem of interest or ''domain of interest refers to any 
polypqrtideQrproteiD) or polypeptide or protein domain, that one wishes to fiise to an IDFP 
molecule of the invention. The fusion of an IDFP with a polypeptide of interest may be through 
linkage ofthe IDFP sequence to dther the N or C terminus of the fusion pa^ Fusions 
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comprising IDFP pol^qptides of the invention need not comprise only a sin^e polypqitide or 
domain in addition to the IDFP. Rather^ any nunberofdomaiiisofinteTestniay be linked in any 
way as long as the IDFP coding tegion retains its reading frame and Ihe encoded polypeptide 
retains floorescence activity under at least one set of conditions. One non-limiting example of 
such conditions includes physiological salt concentration (i.e.» about 90 mM)» pH near neutral 
and 37*^0. 

Exanq)les of proteins of interest include^ but are not limited to rec^tors (transmembrane 
and intracellular) and cell sur&ce protehis^ growth fectors, signal transduction proteins, 
transcripticm &ctors, structural proteins (e.g., cytoskeletal proteins, nuclear matrix proteins, 
histones, etc.), extracellular matrix proteins, immunoglobulins, bacterial proteins, plant proteins, 
viral or phage proteins, enzymes, therq>eutic proteins, phosphcqxroteins, ^tycofroteins, and 
lipoproteins. 

Production 9f Intram olecular Dfaner Fluoresca^t Prnteins 

The production of IDFPs from recombinant vectors may be effected in a number of ways 
known to those skilled in the art. For example, plasmids, bacteriophage or viral vectors may be 
introduced to proksiryotic or eukaryotic cells by any of a number of ways known to those skilled 
in ^ art Examples of useful vectors, cells, m^ods of introducing vectors to cells and methods 
of detecting and isolating GFP polypeptides and variants thereof are also described herein below. 

1. Vectors Useful Accoiding to the bventtoxL 

There is a wide array of vectors known and available in the art that are useful for the 
expression of IDFPs according to the invention. The selection of a particular vector clearly 
depmds upon the intended use of Ifae polypeptide. For example, the selected vector must be 
capable of driving expression of the polypeptide in tiie desired cell type, whether that cell type be 
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prokaryotic or eukaryotic. Many vectots ooixi(his6 sequences allowmg both prokaryotic vector 

replication and eukaiyotic esqiression of operably linked gene seqaeoces. 

Vectors useful according to the invention may be autonomously replicatmg, that is, the 
vector, for example, a plasmid, exists extracfaromosomally and its replication is not necessaiily 
directly linked to the rq>licatLon of the host cell's genome. Altemativelyy the r^lication of the 
vector may be linked to die replication of the host's chromosomal DNA, for example^ the vector 
may be integrated into the chromosome of the host cell as achieved by retroviral vectors. 

Vectors useful accordmg to the invention preferably conqmse sequences operably linked 
to tiielDFP coding seq[a6nces that permit the transcription and transit . 
Sequmioes that permit the transcr4>tion of the linked IDFP sequence include a promoter and 
optionally also include an enhancer element or elements permitting the strong expression of due 
linked sequences. The term **transcrq)tional regulatory sequences" refets to the combmation of a 
promoter and any additional sequences confoiing desired e?q>ression diaracteristics (e.g., high 
level expression, inducible e?cpression, tissue- or ceU^ype-specific expression, or a combination 
of these) on an opanbly linked nucleic acid sequence. 

The selected promoter may be any DNA. sequmce diat exhibits transcriptional activity in 
the selected host cell, and may be derived fiom a g»e normally eiqpressed in die host cell or 
fiom a gene normally expressed in other cells or organisms. Examples of promoters include, but 
are not limited to die following: A) prokaryotic promoters - E. coli lac» tac» or tip promoters^ 
lambda phage Pr or Pl promoters, bacteriophage T7, T3, Sp6 promoters, B. subtiKs alkaline 
protease promoter, and Hie B. steaxodiermophilus maltogenic amylase promoter, etc.; B) 
eukaryotic promoters - yeast promoters, such as OALl, GAL4 and odier glycolytic gme 
promoters (see for example, Hitzeman et al., 1980, J. Biol. Chem. 255: 12073-12080; Alber & 
Kawasaki, 1982, J. Mol. AppL Gm. 1: 419-434), LBU2 promoter O^lartinez-Ganaa et al., 1989, 
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Mol Gea Geoet 217: 464-470), alcohol dehydrogenase gene piomoteais (Young al^ 1982, in 
Genetic Engineering of Miaooiganisms fiar CSiemicals, Hollaender et al*, eds.» Plenum Press, 
NY), or the TPIl promote: (U.S. Pat No. 4,599^1 1); insect promoters, such as the polyhedrin 
promoter (U.S. Pat No. 4,745,051; Vasuvedan et al., 1992, FEES Lett 311: 7-11), the PIO 
promoter (Vlak et aL, 1988, J. Gen. Virol. 69: 765-776), the Autographa califomica polyhediosis 
vims basic protem promoter (HP 397485), &e baculovirus immediate-early gene promoter gene 
1 promoter (U.S. Pat Nos. 5,155,037 and 5,162,222), the baculovirus 39K delayed-early gene 
promoter (also U.S. Pat Nos. 5,155,037 and 5,162,222) and the OpMNPV immediate early 
promoter 2; TnamTnaltan promotes - the SV40 promoter (Subramani et at, 1981, Mol. Cell. BioL 
1: 854-864), metallothionein promoter (MT-1; Pabniter et aL, 1983, Science 222: 809-814), 
admovirus 2 major late promoter (Yn et aL, 1984, NucL Adds Res. 12: 9309-21), 
cytomegalovirus (CMV) or other viral promoter (Tong et al., 1998, Anticancer Res. 18: 
7 1 9-725), or even the endogenous promoter of a gene of interest in a particular cell type. 

A selected promoter may also be linked to sequences rradering it inducible or tissue- 
specific. For example, the addition of a tissue-specific enhancer element upstream of a selected 
promoter may render the pnnnoter more active in a given tissue or ceU type Alternatively, or in 
addition, inducible esqsressionmay be achieved by linking the promote to any of anmnber of 
sequence elemmts permitting induction by, for example, thermal changes (ten9)erature 
sensitive), chemical treatment (for exanqile, metal ion- or IPTG-inducibleX or the addition of an 
antibiotic inducing agent (for exanq)le, tetracycline). 

Regulatable expression is achieved using, for example, e3q[>ressiQn systems that are dmg 
inducible (e.g., tetracycline, rq>amycin orhormono-indudble). Dmg-regulatable promoters that 
are particukurly well suited fijr use in mammalian cells include the tetracycline regulatable 
promoters, and glucocorticoid stmid-, sex hormone stenrid-, ecdysone-, lipopolysaccharide 
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(LPS)- and isopropylMogalactoside (IPTG)-regulatable promotes. A legulalable eoipressioii 

system for use in mnnmiaiian cells should ideally, bfut not necessarily, involve a transcriptianal 
regulator ftat binds (ot Ms to bind) nonrnammalian DNA moti& in le^KHise to a regulatory 
agent, and a regulatory sequence that is reqwnsive only to this transcriptional regulator. 

Tissue-specific promoters may also be used to advantage with DDFP-enooding constructs. 
A wide variety of tissue-specific loomoters is known. As used harran, the term "tissu^-qpecific" 
means that a given promoter is transoiptionally active (i.e., directs the expression of linked 
sequences sufficient to permit detection of tte polypeptide product of the promoter) in less than 
all cells or tissues of an raganism. A tissue speaSc promoter is preferably active in only one cell 
^pe, but mxy, for exanq>le, be active in a particular class or lineage of cell types (e.g, 
hematopoietic cells). A tissue specific promotarusefiJ accocdingto the inventitm comprises 
those sequences necessary and sufficient for flie e3q>ression of an oparably linked nucldc add 
sequence in a manner or pattern ttut is essentially the same as the nuimer or pattern of 
expression of die g«ie linked to that promoter in nature. Any tissue specific transcriptional 
regulatory sequence known in the art may be used to advanta^ with a vector Micoding an IDFP. 

. Iaadditi<m to promoter/ehhancer elonents, vectors usefiil according to flieJuivBntiQn may 
furflicr conqirise a suitable terminator. Such tetmmators include, fijr example, the human gtowfli 
hcmnone tenninator (Pahniter et aL, 1983, siqjra), or, for yeast or fungal hosts, the TPIl (Alber & 
Kawasaki, 1982, supra) or ADH3 terminator (McKnigJit et aL, 1985, EMBO J. 4: 2093-2099). 

Vectors useM according to the invention may also conqmse polyadenylation sequences 
(e.g., the SV40 or AdSBlb poly(A) sequence), andtranslatianal oihancer sequences (e.g., diose 
fiom Adenovirus VA RNAs). Furfliar, a vector useful according to dw invaation may oicode a 
signal sequence directing the recombmant polypeptide to a particular cellular conq>artment or, 
attasativdy, may encode a signal directing secretion of the recombinant polypqptide. 
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A vector useful according to the inventicm may also comprise a selectable maiker 

allowing identification of a cell tiiat has received a functional copy of the IDFP-encoding gene 
construct In its simplest form, the IDFP sequence itself Mated to 
considCTed a selectable marker, in that iUumination of cells or cell lysates with the proper 
wavelength of Ugjit and measurement of emitted fluorescence at the expected wavelength allows 
detection of cells tibat e^qjress the IDFP construct In other forms, flie selectable maricor may 
comprise an antibiotic resistance gene, such as the neomycin, bleomycin, zeocm or phleomycin 
resistance genes, or it may comprise a gene whose product conq)lem«rts a defect in a host cell, 
such as die gene encoding dihydrofolate reductase (DHER), or, for example, in yeast, die Leu2 
gene. Altesmatively, lie selectable inaiker may, in some cases be a luctf^ 
chromograiic substrate-converting «izyme gene sudi as Ihe p-gplactosidase gene. 

IDFP-encoding sequences according to die invention may be eoqiressed eidier as free- 
standing polypeptides or as fusions widi other polypeptides. It is assumed Aat one ofsldll in the 
art can, given an IDFP nucleic acid sequence, readily construct a gene comprising a sequence 
encoding the IDFP fiised in firame to one or more polypq>tides or polypeptide domains of 
interest Referees teachmg mediods to do so mclude Sambrook et aL, 1989, Molecular 
nnfiin p A Laboratory Manual. Cold Spring Harbor Laboratory Press, Cold Spring Harbor, NY, 
and Ausubel et al. (eds.), 1993, Cnnrent Protocsols in Molecu lar Bioloev on C1>-R0M> John 
Wiley & Sons, New York, NY. 

A schematic diagram of a vector encoding the transcription unit of one possible 
embodiment offliemvMition is shown in Figure 4. M this embodiment an mlramoleculardimer 
humanized R. reniformis GFP (hrOFP) is encoded on a constract driven by liie strong CMV 
promoter and containing a multi-cloning site (MCS) downstream of ifae second, or C-terminal 
copyofhrGFP. Ageneofint^^stisfcsedattheC-terminusofthehrOFPdimerbymse^ 
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frame into the MCS. A polyadenylation site sequence is included 3* of the MCS to oahance flie 
stabiUty and processing of flietraiis<arq?tgene^ The (Gly4Sa)MliiJkets shown lep^ 
three examples of a linker pq?tide sequence nseful according to the invention and are not meant 
to be limiting. 

a. Plasmid vectors. 

Any plasmid vector that allows expression of an IDFP coding sequence of the invention 
ma selected host ceU type is acceptable for use according to the inventi^^ A plasmid vector 
useful in the inventira may have any or all of the albove-noted diaracteristics of vectors useful 
according to the invmtioa Plasmid vectors useful according to the inveaitioninchide, but are not 
Ihnited to the following examples: Bacterial - pQE70, pQE60, pQE-9 (Qiagpn) pBs, phagescript, 
psiX174, pBhiescript SK, pBsKS, pNHBa, pNH16a, pNHlSa, pNH46a (Stratagpne); pTrc99A, 
pKK223-3, pKK233-3, pDR540, and pMT5 (Pharmacia); Eukaryotic - pWLaeo, pSV2cat, 
pOG44, pXTl, pSG (Stratagene) pSVK3, pBPV,pMSG, and pSVL (Pharmacia). However, any 
other plasmid or vector may be used as long as it is replicable in the host 

b. Bacteriophage vectors. 

Thare are a numba of wcffl known bacterippbage-derived 
invention. Foremost among these are flie lambdarbased vectors, sudi as Lambda Zap II or 
Lambda-Z^ Express vectors (Stratagene) that allow inducible expression of tiie polypeptide 
encoded by die insert Others mcludefflamentous bacteriophage sudi as the Ml 3-basedft^ 

of vectors. 

c. Viral vectors. 

A number of diflfermt viral vectors are useful accordmg to the invention, and any viral 
vector that permits flie introduction and expression of sequences encoding an IDFP in cells is 
acceptableforuseinthem^odsofthemv«tion. Vnal vectors that can be used to deliver 
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foreign nucleic acid into cells include but are not limited to xetrovual vectors, adenoviral vectors, 

adeno-associated viral vectors, hetpesviral vectors, and Semliki forest viral (alphaviral) vectors. 

Defective retroviruses are well characterized for use in gene tcansfo (for a review see lASkac, 

A.D. (1990) Blood 76:271). Protocols for producing recombinant retroviruses and for infecting 

cells in vitro or in vrvo with such viruses can be found in Ansubel et al. (edsO» 1993, supra, and 

other standard laboratory manuals. 

In addition to r^ioviral vectors, adenoviruses can be manipulated such that they encode 
and express a gene product of interest but are inacdvated in terms of their ability to repUcate in a 
normal lytic viral life cycle (see for example Bedcner et al., 1988, BioTechniques 6:616; 
Rosenfeld et al., 1991, Science 252:431-434; and Rosenfeld et aL, 1992, CeU 68:143-155)* 
Suitable adenoviral vectors derived from the adenovirus strain Ad type 5 dl324 or oflier strains of 
adenovirus (e.g., Ad2, Ad3, Ad7 etc.) are well known to those skilled in the art 
Adeno-associated vuus (AAV) is a naturally occurring defective virus tibat requires aiK>ther 
virus, such as an adenovirus or a hecpes virus, as a helper virus for efGdent replication and a 
productive life cycle. (For a review see Muzyczka et al., 1992, Curr. Topics in Micro, and 
bnmunoL 158:97-129). An AAV vector such as that desccihed injnraschm et al. (1985, MoL 
CelL Biol. 5:3251-3260) can be used to introduce nncldc acid into cells. AAV vectors are usefiil 
for die introduction of nucleic add sequences into a varied of different cell types (see, for 
example, Hermonat et al., 1984, Proc. Nafl. Acad ScL USA 81 : 6466-6470; and Trascfain et aL, 
1985, Mol. CelL BioL 4: 2072-2081). 

Fmally, the introduction and escpression of foreign ^es is often desiced in insect cells 
because hig^ level expressim may be obtaiQed, the culture conditions are siniple relative to 
mammalian cell culture^ and the post-translational modifications made by insect cells closely 
resemble those made by mammalian cells. For the introduction of foreign DNA to insect cells, 
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such as Drosophila S2 cells, infectioii with bacdovirus-vectors is widely used. Oflier insect 

vector systems include, for exan^l^ the expression plasmid pIZ/VS-His OhVitrogen) and other 

variants oftfaepIZ/V5 vectors encoding other tags and sdecte^ Bisect cells are 

leadily transfectable using lipofection reagents, and there are lipid-based transfection products 

specifically optimized for (be transfection of msect cells (for ^caniple, from PanVera). 

2. Host Cells Useful According to the Invention. 

Any cell into which a recombinant vector carrying an IDFP sequence may be introduced 
and wherein Ae vector is permitted to drive the e3q)ression of the IDFP is useful according to ttie 
invention. That is, because of the wide varirty of uses ftnr the IDFP molecules of the invention, 
any cell in which an IDFP molecule of the invention may be expressed and preferably detected is 
a suitable host Vectors suitable for the introduction of IDFP-encoding sequences to host cells 
fiom a variety of different organisms, both prokaryotic and eukaryotic, are described herein 
above or known to diose skilled in the art 

Host cells may be prokaryotic, such as any of a number of bacterial strains, or may be 
eukaryotic, such as yeast or other fimgal cells, insect or amphibian cells, or m a mmali a n cells 
inchiding, for example, rodent, sunian or human cells. Host cells may alsQ;be plant cells. Cells 
expressing IDFPs may be primary cultured cells, fiir example, primary human fibroblasts or 
kmtinocytes, or maybe an established ceU line, such as NIH3T3, 293T or CHO cells. Further, 
mammalian cells useful for eaqjression of IDFPs may be phenotypically normal or oncogenically 
transformed. It is assumed that one skilled in tiie art can readily establish and maintain a chosen 
host cell type in culture. 

3. Introduction oflDFP-Encodmg Vectors to Host Cells. 

IDFP-encoding vectors may be introduced to selected host cells by any of a number of 
suitable methods known to those skilled in the art For example, IDFP constmcts may be 

33 



wo 02/048174 PCTAJSOl/48690 
introduced to appropri ate bacterial cells by infection, in tiie case of E. coli bacteriopbagp vector 

paitides such as lambda or M13» or by any of a number of transformation methods for plasmid 

vectors or for bacteriophage DNA. For exainpl^ standard calcinm-chlorido-mediated bacterial 

transformation is still conmionly used to introduce naked DNA to bacteria (Sambrook et al., 

1989, supra), but electroporation may also be used (Ausubel et al. (eds.), supra, 1993). 

For the introduction of JDFF-^coding constructs to yeast or other fungal cells, chemical 
transfonnation m^ods are gena:ally used (e.g. as desoribed by Rose et al., 1990, Methods in 
Yeast Genetics. Cold Swing Harbor Laboratory Press, Cold Spring Harbor, NY). For 
transformation of S. cerevisiae, for example, the cells are treated wifli Htfaiimi acetate to achieve 
transformation efficiencies of qyproximately 1 0^ colony-forming units (transformed cells)/|jg of 
DNA. Transformed cells are then isolated on selective media appropriate to tiie selectable 
markernsed. Alternatively, or in addition, plates or fdters lifted fixmphtesn^^ 
IDFP fluorescence to idmtify transformed clones. , 

For the introduction oif IDFF-encoding vectors to mammalian cells, the metiiod used will 
depend \jpon the form of the vector. For plasmid vectors, DNA encoding an IDFP may be 
introduced by any of a^numb^ of transfection metiiods, including, for example, lipd-mefliaS^^ 
transfection ("lipofection*^, DEAE-dextran-mediated transfection, electroporation or calcium 
phosphate predpitation. These metiiods are detailed, for example, in Ausubel et aL (eds.), 1993» 
supra. 

Lipofection reagents and metiiods suitable for transient transfection of a wide variety of 
transformed and non-transformed or primary cells are widely available, making lq)ofection an 
attractive method of introducing constracts to eukaryotic, and particularly mammalian cells in 
culture. For raample, LipofectAMINB™ (life Technologies) ot LipoTaxi™(Stratagene) kits are 
available. Other oompaiues offering reagents and methods for Upofectionii^ 
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Laboratories, CLONTBCH, Qlea Research, kVittogpn, JBL Scieatific, MBI Fennentas, 
PanVera, Piomega, Quantum Biotechnologies, SigmapAldrich, and Wako Chemicals USA. 

For fliB introduction of IDFP-cncoding vectors to insect cells, liposome-mediafed 
transfection is commonly used, as is baculovirus infection. Cells such as Schneida:-2 cells 
{Drosophila melanogaster), Sf-9 and Sf-21 cells {Spodopterafrugfperda) or High Five™ cells 
(Trichoplusia ni) maybe transfected usmg any of a number of commercially available liposome 
transfection reagents optfanized for use with insect cells. Reagents include, for example, 
TYansIT-Ihsecta™ (PanVera), FuGENE™-6 (Roche), Insectin™-Flus (IhVitrogpn) and Tft™- 
20 (Piomega). Each of these reagents, used according to the vendor's instructions, pennits flxe 
introduction of nucleic add vectora encoding an IDEP to insect cells. Esqpression vectors 
optimized for insect cell expression are widely known and are commercially available fiom, fiir 
example, Clontech and IhVitiogeii. These mclude both plasmid-based vectors and baculovirus 
vectors, tocct cell CTpr^"^ ^*>At«rQ utr Hasftrilier! in deiail in *^acnlovirus Bcpression 
Vectois" . D.R O'Reilly, L.K. Miller & VA. Luckow (1992, W JL Freeman Co., New York). 

Following transfection with an IDFP-«icoding vector of tiie invention, eukaryotic 
^roferably, but not necessarily^nianiiiiafian) cells successfully incorporating flie construct (intra- 
or extrachromosomally) may be selected, as noted above, by eitiier treatment of the tiansfiected 
population with a selection agent, such as an antibiotic whose resistance gene is encoded by .the 
vector, or by direct screening using, for example, F ACS of the cell population or fluorescence 
scanning of adherent cultures. Frequently, boA types of screening may be used, wherein a 
negative selection is used to enrich for cells takmg up tibe constract and FACS or fluorescence 
scanning is used to further enrich for cdls expressing IDFPs or to idsaatify ^edfic clones of 
cells, respectively. For «cauq)le, a negative selection wifli the neomycin analog G418 (life 
Technologies, Inc.) may be used to idartify cells that have received the vector, and fluorescence 
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scanning may be used to identify those cells or dones of ceils lliat exinress an IDFP to tlie 

gretttest extent 

4. Modificatiim of nucleotide sequences to enhance transladon of IDFPs. 

In many ^jplications it will be advantageous to enhance the egression of fluorescent 
ITOtdns derived fixmi niarine invertdoates or bacteria by modifying the co^^ 
sequences to niake Aem more con^atible wth codcm usage in higjter eukaiyotes, w 
mammals and humans. The mediods for diis so-called • Timnanizing " are known in the art and, as 
noted above, have been ^lied to A. vUaoria GFP and mratants thaoeof (U.S. Patent Nos. 
6,020,192 and 5,874,304). Homanization is accomplished by site-directed mnt^snesis of the 
less fevored codons to more hi^y fivcHed codons for die same ammo add, as described hetem 
or as known m die art The preferred codwis for human gene eoqpresaon are Hstrf in Ta^^^ 
The codons m the table are arranged &om left to right in descending order of relative use m 
faoman genes, hi particular, fliose codons undedmed m die table are ahnost never used m known 
human genes and, if found m a sequence to be humanized, wonld dierefore represent the most 

inqmrtant codons to modify for enhanced expression effidency m mammaUan or human cells. A 
sequence is considered "humamzecT if die oqdqp <»e or more ammo adds has been changed 
fiom die native codon sequence to a codcm sequence more &vored for translation m human or 
mammalian cdls, prrfoably widiout ahermg die polypeptide coding sequence. Site-directed 
mutagenesis is wen Imown m die art and is often performed ndng commerdalfy available ^ 
sudi as die EXSITE™ (Catalog No. 200502), QUIKCHANGE™ (Catalog No. 200518) or 
CHAMELEON® mutagenesis kits (Catalog No. 200S09), available fiom Stnitagene. 
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TABLE 1 

PREFERRED DNA CODONS FOR HUMAN USE 

Q)dcms Predtoed in Homan Genes 
GCCGCTGCAGCO 
TGTTGT 
GACGAT 
GAGGAA 
TTCTTT 

GGCGGGGGAGOT 
CACCAT 
ATCATTATA 
AA6AAA 

Crm TTfi CTT CTATTA 
ATG 

AACAAT 
CCCCCTCCACCG 
CAGCAA 

CGC AGO CGG AGA CXJA OGT 
AGO TCC TCT AGT TCA ICG 
ACCACAACTACG 
GTGGTCQTTOT^ 
TGG 

TAG TAT 

*«w »v preferred for use in human ^es, with honuin 

usage decreasing towards tiie light Underlined codms are ahnost never used in human goies. 

5. Purification of intramolecuhirdimerfiuorescaitiHPOtdns. 

Recombinant fluorescoit proteins can be purified from bacteria as follows. Bacteria 
transfbnned witii a recombinant n>FP-encoding vector of fee invention are grown in Lutiar 
Bertani medium containing the appropriate selective antibiotic (e.g., an^idlKn at 50 pg^ml). If 
fliB vector pemits, reconibinant polypeptide expiBsmon is induced by tiie additi^ 
appropriate inducer (e.g., IPTG at 1 mM). Bacteria are harvested by centrifiigation and lysedby 
fteeze-thaw of the cell pellet Debris is removed by centrifiigation at 14,000 x g, and the 
supernatant is loaded onto a Sephadex G-75 (Pharmacia, Piscataway, NJ) column equilibrated 
with 10 mM phosphate buffered saline, pH 7.0. Fractions containing IDFP are identified by 
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ftuorescence emission at the e3q>eGted waveleaiglii wfam exdtsd by light in the exdtatiaa 
WBvelengtfa. 

If necessary, IDFPs can be isolated fiom eokaryotic cells by meOiods well known to 
those skilled in the art It is also contemplated that IDFPs will include a marker or afBnity tag 
sequence to permit afBnity purification. B7canQ>Ies include 6X-His, ghitathione S transferase 
(GST), or epitope tags such as Flag or tiie Myc tag. These tags are usefid for both bact^ial and 
eukaryotic cell e}qxression and purificaticm of IDFPs. 

6. Candidate modulators. 

A candidate modulator or candidate agrat being evaluated for a modulatory function on 
a given interaction or biological process may be a synthetic conqiound, a mixture of compounds, 
or may be a natural product (e.g. a plant extract or culture supernatant). 

Candidate ag^ts from large libraries of synthetic or natural compounds can be screened. 
Numm)us means are currently used for random and directed synfliesis of saccharide, peptide, 
and nucleic add based confounds. Synthetic compound libraries are commercially available 
fix>m a number of companies including Maybridge Chemical Co. (Trevillet, Comwall, UK), 
Comdex CPrinceton, NJ), Brandon Associates (Merrimack, !NH), and IMQcrj^urce (New 
Milford, CT). A rare chemical library is available fiom Aldrich (Milwaukee, WI). 
Combinatorial libraries are available and can be prepared. Alternatively, libraries of natural 
ftftmpmmila in the form of bacterial, fungal, plant and animal extracts are available fixmi e.g». Pan 
Laboratories (Bothell, WA) or MycoSearch (NC), or are readily produceable by mettiods well 
known in the art Additionally, natural and synthetically produced lifaiBries and confounds are 
readily modified through conventional chemical, physical, and biochemical means. 

Useful candidate compounds may be found wittiin numerous chemical classes. Such 
conq)ounds may be organic compounds, or small organic compounds. Small organic compounds 
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have a molecular weight of more than 50 yet less than about 2,500 Daltons, prefisiably less than 
about 750» mote preferably less than about 350 daltous. Exeioplary classes iuchide heterocycles^ 
peptides, saccharides, steroids, and the like. The con^)ounds may be modified to enhance 
ef5cacy» stability^ phannaceutical compatibiUty, and the like. Stnictural identification of an 
agent may be used to identify^ generate, or screen additional agents. For example, where peptide 
agents are idmtified, fhisy may be modified in a vari^ of ways to enhance their stability, such 
as using an unnatural amino add, such as a D-amino add, particularly D-alanine, by 
fimctionalizing the amino or caifooxylic terminus, e.g. for the amino grot^, acylation or 
allegation, and for die carboxyl groi^, estmfication or amidificafion, or the like. 

Cmdidfttf> agants will be effective at varying concmtrations, depending on tibe nature of 
the agent and on the nature of its interaction with the polypqrtide or polypeptide fiagment of 
interest Therefore, candidate agents should be screened at varying concentrations. Generally, 
concentrations fiom about 10 mM to about IfMaie prefab The association 

constants of ag^ that bmd polypeptides or fiagments thereof will generally be in the range of 
about 1 mM to about 1 £M, and optimally in the range of about 1 to about 1 pM or less. 
Uses of Int rftniftlemlft r Dimer Fhiorescmt Proteins Acco nimfy tn fte hivmtion ^ v •^•^ 

' IDFPscanbeusedinany application for wUch fluorescent protdns are sui For 
example, IDFPs can be used as reporter genes to monitor the activity of promoter sequences, to 
investigate the cellular localization of fiision protems, to maik cellular protems for FACS 
analyses of cell populations, to monitor viral vector infection, to monitor transgene expression in 
VIVO or in culture, and to monitor protein:proteinmteracti^ Itisalso 
e3q)ected that IDFPs comprising fluorescent proteins ^ose spectral charadmstics are sensitive 
to intracellular or extracellular environmental changes (e.g., pH, redox status, phosphorylation of 
the fluorescent protein, etc.) will continue to be sensitive to those changes in the context of an 
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IDFP. Because IDFPs do not heterodimerize, tiiey are particularly weQ suited for multiple- 
labeling studies involving die co-expression of IDFP-fusion proteins witti differing spectral 
characteristics. Techniques useful &r die d^ection of IDFP fiision proteins include, for 
example, standard fluorescent microscopy, confocal microscopy^ flow cytometry and 
fluorescence activated cell sorting (FACS). 

As noted, IDFPs are particularly weU suited to appUcations that rely on FR^ The lack 
of heteiodimerization between IDFPs with differing spectral charact^tics that permit FRET but 
tiiat share fbs same dimerization iaterbc^ is amq'or inqxcovment over previous methods using 
fluorescent protems tiiat could heterodimerize, since it removes a significant source of FRET 
backgroimd In one embodiment two difGrarent IDFPs that have ov^lsqpping CTussion and 
excitation spectm (i.e., they are donor and acceptor to each other) are used to generate fiisions 
with two differ^ cellular (or viral) proteins or protein domains being investigated for their 
ability to interact A q)ecific interaction oftfae fusion partners wiU result in a change in t^ 
detected emission sfpectrum fiom that of die donor to that of the acceptor when a mixture of the 
two IDFP fiision proteins is irradiated with light that excites the donor fluorophore. This type of 
assay is readily ddapbsd to a screemng formal in which known interactors are e^qjosed to 
candidate compounds. Detection of a change fixnin the accq>tor's emission profile to the donor's 
emission profile indicates that a candidate compound has disrupted die int^ction between die 
fiision partners. Eitiierofdiese assays can be performed in vivo or in vitro. An exanq>le of a 
donor/acceptor fluorescent protein pair is P4-3 and S65C or S65T (Table 2; U.S. Pat No.. 
5,981^00). Odier examples of donor/acceptor pairs of fluorescent polypeptides include, but are 
not limited to any of S72A, K79R, Y145F, M1S3A and T203I (excitation X 395 ran, emission X 
511 nm) as donor, and any of S65Q, T203Y (excitation X 514 nm, emission X 527 nm) or 
T203Y/S65Q, V68L or (J69K (excitation X 515 nm, emission X 527 nm) as acceptor (See Tsien 
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et al., wo 97^261). Each of these protrins shares the dimerizatLcm interfece of A. victoria 

GFP. Their e:q>i€Ssion as IDFPswoiildaDowflieir co-expression w 

A pair of fhiorescent proteins fliat are useful according to the invention fimction as 
fluorescent donor and fluorescent acceptor, respectively, in the context of fluorescence resonance 
energy transfix. The ability of a pair of fluorescent proteins to function as fluorescent donor and 
fluorescent acceptor, respectively, in the context of fluorescence resonance energy transfer, is 
determined experimentally and is influenced by a number of &ctors including donor/accq)tor 
peaks, emission/racitation peaks, peak widths, the ef5ci«cy of energy transfer within a 
fhiorescent moi^ and peak overlap. Preferably, 1) the donor excitation peak, A, figure 5, will 
overly minimally with the acceptor excitation peak, C, figure 5, such that excitation of the 
donor does not excite the acceptoi^ 2) the donor excitation peak. A, figure 5, and tbe donor 
emission peak, B, figure 5, have sufBcient overlap to permit efSdent energy transfer; 3) the 
donor emission peak, B, figure 5, and die accq>tor excitation peak, C, figure 5, have suffidotit 
ovo^lap to permit efficient FRET energy transfer, 4) the donor emission peak, B, figure 5 and the 
acceptor emission peak, D, figure 5 have sufficient overlap to allow for differentiation between 
FRET and non-FRET energy transfer; and 5).flie donor excitation peak. A, figure S and the 
acceptor »ussion peak, D, figure 5 have suffident overlap to allow for differentiaticm between 
FRET and non-FRET energy transfer. 

Genotdly, an accqrtable donor/acceptor pair exhibits > 50% quendiing of donor 
emission at a chromophore distance of > 10 A, This is based on the FSrster radius, R^, which is 
the distance at which 50% of exdted donors are deactivated by FRET (i.e., distance at which 
energy transfer is 50% efficient). The value of Ro is d^endrat on the spectral properties of the 
donor and acceptor fluorophores, with a general formula: Ro = [8.8 x 10^ • • n"^ • QYd • 
J(X)]^^ A, where: = dipole orientation fiutor (0-4; = 2/3 for randomly orirated donors and 
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acceptors); Q Yd = the quantum yield of Ibe donor in the absence of the aoceptoi^ n - the 

xefractive index; and J(X) » the spectral overlq> integral (the shared area under the overlapping 

donor onission and acceptor excitation peaks). 

The advantage resulting fiom &e forced intramolecular homodimer formation is most 
apparent when, for exanQ)le» fluorescent protems with different emission djaractensticS) derived 
fixmi the same parent fluorescent protein, are eoqxressed in a single cell. For example, if two 
variants of K reniformis GFP have spectral characteristics that permit FRET between the 
variants, both of these proteins win have Che same dimerization^^ Without the forced 
homodimerizatLon occurring in an IDFP, the background level of acceptor fluorescence upon 
irradiation within the donor's excitation spectrum will be higher than if IDFP versions of die 
same fluorescent proteins are used. 

Even if FRET does not occur between differing fluorescent protenis that share the same 
dimerization interfoce, heterodimerization betwe« two fluorescent fosion proteins via that 
mter&ce can be a problem. For ^campl^ such heterodimerization can reduce the sensitivity of 
sub-celhilar localization studies using two labels. Heterodimerization will segregate the labeled 
proteins into three populations: homodimers of the first fusion protein, homodimers of the 
second fusion protein, and heterodimers conq)rising both. Even if one assumes, strictly for die 
sake of argument, that the hi^erodimerization will not affect the intracellular localization of the 
proteins, heterodimer formation will reduce ttie amount of eitfa^ homodimer available to 
segregate to a given location in the cell. This will result m decreased sensitivity in die assay. 
Therefore, the use of IDFFs in sudi a situation will hxqrove iqnm detection sensitivity ev« if 
one is not relying upon FRET for detection. 

EXAMPLES 

Example 1, Monitoring the interactton of potypeptides using IDFPs. 
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IDFPs are well suited for applications that monitor ttie association of fission polypeptides 

using energy transfer. Jn order to monitor the association of two polypeptides of interest using 
tt)FPs, one must firet select a pair of fluorescent polypeptides fliat are donor and acceptor to each 
otiier. Each polypeptide in the pair must be capable of homodimerization. An example of such a 
pair P4-3 and S65T. Anotiber pair useful according to the mvention is P4-3 and K remfomds 
GFP (hiGFP). The nucleic acid sequMce «icoding the fluorescMce donor po]^q)tide is used to 
generate a construct encoding, in order, a copy of the donor polypq)tLde (e.g., P4-3), a linker, a 
second copy of the fluorescence donor polypeptide and one of the polyp^des of interest 
(alternatively, the sequence encoding flie protein of interest may be placed i^stream of and in 
fiame with the sequ^ices encoding ^IDFP). Similarly, &e sequence encoding the 
fluorescence acceptor polypq)tide is used to genarate a construct encoding, m order, a copy of 
die accq)tor polypeptide (e.g., S65T), a linker, a second copy of the fluorescence acceptor 
polypeptide and the second polypeptide of interest (altmiatively, the seqiraice mcoding the 
protein of interest may be placed upstream of and in fiame with the sdqpiences »codmg the 
acceptor IDFP). An example of a pair of proteins of interest is die Ras proto-oncogene product 
mdflieRaf-1 kinase. The 0-Pn)tein Ras buids to Raf-1 in response to sign^ 
receptor tyrosine kinases. A human o-Ha^RascDNA sequence is available at OCTBank 
Accession No. J00277 , and a human Raf-1 kinase sequence is available at O^ank Accession 
No.NM0Q2g80. As an exanq)le, Ras coding sequences may be ligated in fiame to the donor P4- 
3 IDFP construct, and the Raf-1 coding sequences may be ligated in fimie to the acceptor S65 A 
IDFP construct 

Constrocts encoding the two IDFP-fhsion proteins are transfected, ddier sunultaneously 
or sequentially into cells in which die protein:protein interaction is to be studied (e.g., HeLa 
cells, NIH3T3 cells, or anoflier specific cell type of interest) using methods well known in the art 
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(e.g., lipofecdon, electroporation, calcium pho^hate predpitation, or even letcoviial infecdon 

Mowing geneiation of rec<»nb]nant retiovixal yecbor particles as known in the art). 

After selection of cells incoiporating and expressing ttie constracts by standard mediods^ 
the interaction of the pioteins of interest is measured by detection of fluoresce emission upon 
irradiation with light that excites the donor fluozophore, in this instance P4-3» but not the 
acceptor fluoiophore, S6ST. tiie fused Ras and Raf-1 domains interact, excitation widi 381 nm 
light will result in energy transfer between the P4-3 and S65T fluorophores and Mussion of li^t 
with a maximum at about 51 Innt b contrast, if Aedonoains do not interact, the emission 
mflYrnnim cxcitation at 38 1 nm will be at about 445 nm, the emission maxhnum of P4-3. 
This therefore allows the monitoring of the interaction of the two domains in response to stimuli, 
such as the addition of growth &ctQr, growth fiu^ analogs, or candidate modulators of the 
signal transduction pathway. 

The proteinrprotein interaction assay using IDF? fiision proteins described above may 
also be performed in vitro with isokted or purified IDFPfiisionprot^ This type of assay, or 
even the ceU-based assay described above may be readily adq)ted to a high-throughput format by 
placing the transfected cells or protein samples in a multiwell container and monitoriiig 
fluorescence output of sanq>les exposed to various candidate modulators. Further, by performing 
the interaction assay in the presence or absence of a candidate modulator, one may adapt the 
method for screeoing of candidate modulator conqniunds to identify conqwunds that either 
increase or decrease the measured interaction. A diange in the interaction in ttiepresOTce of a 
candidate modulator relative to the interaction in its absence is indicative of a modulatory effect 
Example 2. LabeUngaeeUwitfaanlDEP. 

IDFPs according to the invention can be used in any application in which fluorescent 
polypeptides are useful. For exanq)le, cells can be labeled by expression of IDFPs to monitor the 
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iq)take and expression of transgene constructs^ including plasmid-based and retroviral constructs. 
Cells nmyako be labded to fedUtatesiAsequeotPAC^ anal Tolabel 
a cell, an IDFP-encoding construct is introduced to cells by standard methods appropriate to that 
cell type. Following introduction, selection for cells receiving fixe construct can either be 
performed by standard positive or negative selection based on additional selectable marker 
sequences (e.g., antibiotic resistance genes), by sorting or selection by FACS, or by allowing 
cells to form colonies and isolating those colonies that fluoresce when irradiated with light 
within the excitation spectrum of the IDFP. Maintaining the cells nnder conditions permitting 
the expression of the IDFP will permit the detection of the cells by fluorescence. 
Example 3. Double label monitoring of protein localization. 

Fhiotescenily labeled proteins are ofien used to examine the scib-cellular localization of 
proteins of interest Frequently, it is useful to monitor the localization of two or more protdns or 
protein domains simultaneously, for example, as a means of identijfying relationships between 
the proteins. When two or more protefais are labeled with fluorescent polypeptides that have flie 
capacity to heterodimerize, the sensitivity of the localization assay can be adversely affected by 
heterodimerization betwem tht). fluorescent polypeptides. 

Examples of protdns to be monitored for localization include protdns that are recruited 
to the vicinity of the plasma membrane upon a stimulus such as growth &ctor engagMient of a 
receptor (e.g., Gh-protems, Protein Kinase A, SH2-domain containing proteins, etc.), proteins that 
localize to the nucleus in response to a stimulus (e.g., steroid hormone receptor), or proteins that 
localize to the golgi, mitochondria, nuclear pores or any other subcellular locale. 

In Older to simultaneously monitor the localization of two proteins of interest, two IDFP 
fusion constructs, each comprising sequences encoding one of the proteins of interest, are 
introduced to cells, either simultaneously or sequentially, using standard methods appropriate for 
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&at cell type. The localization of the DDFP-tagged proteins is monitored by fluoxescttce 

microscopy using excitation wavelengths and filter sets qjpiopriate for the different 
fluorpphores. While not wishing to exchide the possibility, it is generally not necessary that the 
two IDFPs be fluorescent donor and acceptor to each other. More firequently, unless one is 
assaying for direct interaction of the proteins, it is prefenred that the fluorescent proteins are not 
related to each other m this manner. The IDFP fusion protem constructs are made usmg standard 
mefliods well known in the art Exan5)les of pairs of fluorescent polypq)tides that are well 
suited for sfanultaneous monttoring of localization inctudei, but are not limited to any of S72A, 
K79R, Y145F, M153A and T203I (excitation X 395 nm, emission X 511 nm) as donor, and any 
of S65G, T203Y (excitation X 514 nm, emission X 527 nm) or T203Y/S65G, V68L or Q69K 
(excitation X 515 nm, emission X 527 nm) as acceptor (See Tsien ct aL, WO 97/28261). See also 
Table 2. 



46 



wo 02/048174 



PCT/USOl/48690 



TABLE2 



FLUORESCENCE CHARACTERISTICS OF VARIOUS GFP MUTANTS 



Clone 



Mutation (s) max(nm) 



Extinct • 

Excitation ESmission Coefficient Quantum 
max (nm) (M"^cm yield 



Wild type 
P4 

P4-3 
W7 



W2 



S65T 
P4-I 



S65A 
S65C 
S65L 
Y66P 
Y66W 
10c 



WIB 



Emerald 



Sapphire 



None 

Y66H 

Y66H 

Y145F 

Y66W 

N146L 

M153T 

V163A 

N212K 

Y66W 

I123V 

Y145H 

H148R 

M153T 

V163A 

N212K 

S65T 

S65T 

M153A 

K238K 

S65A 

S65C 

S65L 

Y66F 

Y66W 

S65G 

V68L 

V72A 

T203Y 

F64L 

S65T 

Y66W 

N146I 

M153T 

V163A 

N212K 

S65T 

S72A 

N149K 

M153T 

I167T 

S72A 

Y145F 

T203I 



393(475) 

383 

381 

433(453) 



432(453) 



489 

504(396) 



471 
479 
484 
360 
458 
513 



432(453) 



487 



395 



508 21,000 (7,150) 0,77 

447 13,500 0.21 

445 14,000 0.38 

475(501)18,000 0.67 
(17,100) 



480 



10,000 (9,600) 0.72 



511 
514 



504 
507 
510 
442 
480 
527 



39,200 0.68 
14,500 (8,600) 0.53 



476(503) 



508 



511 
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CLAIMS 

1. A recombinant fiision polypeptide conyirising a first polypeptide pq>tide bonded to a 
second polypeptide^ wherein said furst and second polypq)tides are found in nature as monomers 
of a multimeric protein and said first and second polypq>tLde5 are not fluorescent donor and 
acceptor to each othrar, and wherein said recombinant fusion polypqptide is fluorescent when • 
excited 

2. The recombinant fiision polypqrtide of clwrn 1 wherein said first polypeptide and said 
second polypqytide are pq>tide bonded to each other via a linker sequence. 

3. The recombinant fusion polypeptide of claim 2 wherem said linte sequence is fix>m 5 to 
50 amino adds long. 

4. The recombinant fusion polypeptide of claim 1 , further comprising a third polypeptide 
peptide bonded to said recombinant fusion polypeptide. 

5. The recombinant fusion polypeptide of claim 4 wherem said third polypq)tide is a 
member of a specific bindmg pair. 

6. The recombinant fusion polypeptide of claim 4 wherein said third polypeptide is fused to 
the amino t^minus of said first polypeptide. 

7. The recombmant fiision polypqptide of claim 4 wherein said third polypeptide is fused to 
flie caiboxy tmninus of said second polypeptide sequence. 

8. The recombinant fusion polypeptide of claun 1 wherein each of said first and said second 

polypeptide, independently, is a monomer of a multimeric protein selected &om the group 

consisting of: R. reniformis GFP, K mulleri GFP and A victoria GPP. 
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9. The lecombiiiant fusion polypeptide of claim 1 wherein both of said first and second 

polypeptides are monomers of a single multimeric protein selected from the group consisting of 
R. renifonnis GFP, R. mdleri GFP andil. victoria GFP. 

10. A polynucleotide encoding a recombinant fusion polypeptide con^rising a first 
polypq)tide peptide bonded to a second polypeptide, wherein said first and second polypeptides 
are found in nature as monomers of a multimeric protein, and wherein said recombinant fusion 
polypeptide is fluorescent when excited. 

11. the polynucleotide of claim 10 wherein said first polypqrtide and said second 
polypeptide are peptide bonded to each other via a linki^ sequence. 

12. The polynucleotide of claim 1 1 whCTein said linker sequence is from 5 to 50 amino acids 
long. 

13. The polynucleotide of claim 10, wherein said polynucleotide further encodes a third 
polypeptide peptide bonded to said recombinant fusion polypeptide. 

14. The polynucleotide of claim 13 ^erem said tfiird polypeptide is a member^f a^ecific 
binding pair. 

15. The polynucleotide of claim 13 wherein said third polypeptide is fused to the amino 
terminus of said fibcst polypeptide^ 

16. The polynudeolide of claim 13 wherein said third polypeptide is fused to the carboxy 
terminus of said second polypqptide. 
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17. The polynucleotide of claim 10 i«*erem each of said first and said second polypeptide, 
independently, is a monomer of a muWmcric protein selected firam tiie group consisting of it 
rerdfomis GPP.R. mulleriQFP, mdA. victoria GFP, 

18. The polynucleotide of claim 10 wherein both of said first and second polypeptides are 
monomers of a single multimeric protein selected fix)m flie group consisting of A reniformis 
GFP, IL muUeri GFP, A. victoria GFP. 

19. A vector comprising tiie polynucleotide of claim 10. 

20. A cell conoprising the vector of claim 19. 

21. The cell of claim 20, said cell bring a bacterial cell. 

22. The cell of claim 20, said cell bring a eukaryotic cell. 

23. The eukaryotic cell of claim 22, wherein said cell is a yeast cell, an insect cell, or a 
mammalian cell. 

24. ApairofpolypeptidesAonqrismgapolypqitide labels ^■ 
recombinant fusion polypeptide of claim 1 wherem said fkioresccnt dye and said recombinant 
fiision polypeptide are fluorescait donor and accqptor to each other. 

25. A pair of recombinant fusion polypq)tides comprising a first fusion polypeptide as 
claimed in claim 1 and a second fusion polypeptide as claimed in claim 1 wherein said first 
fusion polypeptide and said second fusion polypeptide are fluorescent donor and accq>tor to eadx 
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26. The pair of recombiimt fusion polypq»tide8 of claim 25 v(^^ 

second fimon polypeptides fiirttier comprises a djird polypeptide, and wherein said fliird 
polypeptide of said fiist fusion polypeptide comprises a sequence which is different fiom said 
tiiiid polypeptide of said second fusion polypq>tide. 

27. A mefliod of producing a fluoresceatly labeled recombinant fusion polypeptide, said 
mefliod comprising the steps of introducing a polynucleotide of daim 10 to a cell, and cultiiring 
said cell under conditions tiiat permit die synthesis of said recombinant fiision polypeptide, 
vfbia^ said recombinant fusion polypeptide is produced. 

28. Amethodoflabelingacellwi1hafluorescentreoombinantfiisionpolypq>tide,said 
metfiod conqtrifflng Ibe steps o£ 

(a) introdudng a polynucleotide of claim 10 to a cell; and 

(b) culturing said cell undar conditions fliat pennit the synthesis of said recombinant 
fusion polypeptide, whereby said cell is labeled witix said fluorescent reconibinant fiision 
polyp^de. 

29. The method of claim 26 wherdn, in said introducing step (a), said polynucleotide 
introduced to said ceUfitttherconqirises a sequfflKsemoodmg a Ihhjdp^^ 

to die sequence ^coding said recombinant fiision polypeptide. 

30. A method of monitoring die interaction of two polypq>tide8 of interest said mdhod 
conqnising die 8tq>s o£ 

(a) contacting a first polypq)tide and a second polypeptide wh^in: 

51 



wo 02/048174 PCT/US0iy48690 

(i) said fiist polypeptide is a recombinant fiision polypeptide of claim 4 
wheiein said (hiid polypeptide is a first polypeptide of interest; 

Cii) said second polypeptide comprises a second polypeptide of interest and is 
fhiorescendy labeled; and 

(iii) flie flnorophores comprised by said first and second polypeptides are 
fluorescent donor and fluorescent accq)tor to each other; 

(b) exciting said donor fluorophore; and 

(c) detecting fluorescent emission fiom said fluorescent acceptor, ixrfaerein said 
emission is indicative of the intaaction of said first and said second polypeptides of interest 

3 1. The method of daim 30 wherein said second polypq>tide comprises a fiision polypeptide 
of clahn 5, wherein said fihird polypeptide of said second fiision polypeptide is difBerent firom 
said tiiird polypeptide of said first fiision polypeptide. 

32. The method ofclaim 30 wherein said contacting step is performed in vitro. 
3 3 . The mediod of claim 30 wherein said contacting st^ is performed hi a cell 

34. The mdhod ofclaim 33 wherem said contactmg comprises the step of mtroducmg 
nucleic acid encoding said first and said second polypeptides to a cell. 

35. A method of screening for a compound that modulates the mteraction of a first and a 
second memb«r of a specific bmdmg pair, said method comprising the steps of: 

(a) contacting a first polypeptide and a second polypeptide m the presence and 
absence of a candidate modulator wherein: 
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0 said first polypeptide is a lecombinant fusion potypqrtide of claim 5, 
wherein said m^ber of a specific binding pair is said first memb^ of a specific binding pair; 

(ii) said second polypeptide is fluorescently labeled and comprises said 
second member of a specific binding pair; and 

(iii) the fluorophores comprised by said first and second polypeptides are 
fluorescent donor and acceptor to each olher; 

(b) eKciting said donor fluorophore; and 

(c) d^ecting the jOm^escence of said acceptor fhuHophore, wh^ein emission of the 
spectrum characteristic of said fluorescent accq)tor indicates the interactton of said first and said 
second members of said specific binding pair, and wherein a change in said interaction in Ihe 
presence of said candidate modulator indicates that said candidate modulator modulates tiie 
interaction of the members of said specific binding pair. 

36. The method of claim 35 wherein said second polypeptide is a recombinant fusion 
polypq>tide of claim 5 and said member of a specific binding pair conqprised by said second 
polypeptide is said second member of a q>ecific binding pair. 



53 



wo 02/048174 



1/6 



PCTAJSOl/48690 



1 

1 



o 
u 

I 

Eh 



m 

CI 



vo 
cn 

i 

8 

1 



a 
o 

1 

i 



i 



vo 
vo 



U 

i 

u 



S2 

Li. 



o 

i 

1 

I 



s 

u 

OS 



S 

• 



« 



OS 



SUBSTITUTE SHEET (RULE 26) 



wo 02/048174 



2/6 



PCT/US01/4S690 



§ 

Q 
H 



a 



CO 
O 
Hi 

1 

s 

i 

O 
O 



H 

CQ > 

H S2 

u s 

M 

§ I 

H g 

s I 

i ^ 

Pi CO 
H 

is 

Pi 

!^ g 

H 

piS « 



LL. 





H 




H 


M 


CO 




H 


H 




t* 


















§ 


§ 








• 


• 


* 









SUBSTITUTE SHEET (RULE 26) 



wo 02/048174 



PCT/USOl/48690 



3/6 





QQ 




CO 




d 




U. 



CO 

O 

LL 



I 



I 

I 

o 

as 

O 0> 

I 

O 

p 



99 

o 

e 



g CO go 

gH go 

O O ^« to* 

8 

Eh H Eh > 

g § 

O U 

g w go 

U CD 



O 



to 



o 

U to 
O 

O 

u 



O 



u 

Eh 
O 

o 

U to 

U 

O 

C3 CO 
CJ 

u 

o 
u 
o 



< 
CO 

S2 

LL 



H 



to 



H 
00 

H 



SUBSTITUTE SHEET (RULE 26) 



wo 02/048174 



PCT/US01/48690 



4/6 



8 

K 

p 

o 

r 

u 

S P 

8 

CD 01 

r 



04 



o 

o 
o 



U 

O CO 



u 
o 

CD 
CD 

> 



U 
CJ 

CD O 

CD 

U 

CD O 
CD 
U 



1 



u 

u _^ 
CD 04 
U 
O 



o 



CD 

^> 

CD 
U 

U Qt 
U 

CD CD 
CD 

U 



u 
u 



CD CO 

a 
u 

CD CD 



CD 



CD 
CD 



r 

u 

CD 

O 
U 
H 
H 

H 
VO 

n 



P4 



I: 

CD 

H > 



8 

u 

CO CO 
u cu 

g 

CD 

u 

CD CD 

CD 

L> 

U Eh 

r 



CJ 



u. 

CD U 
Eh 

U 

CD CO 

r 

|. 

CD CD 

8 

CD CO 



u 

CD Oi 
U 

pi JH 

g 
o 

r 

EH > 



3° 

O 

CD O 



g > 



H 

CO 



>Eh"H 

u 

8 

u 
u 

CD 
CD 

CD 

CD CD 



CD CO 
El S 



U 

r 

8 



u 

i 

CD 
CJ 



CD 



CD 

CD 
U 

U 

CD CD 

CD 

U 

CD O 
CD 
U 

3 
o 

CD 

u 

CD Oi 
O 



g 

o 

VD 



o 

CD CO 
g 

O CD 
CD 

a 
u 

u 



o 

CD CO 

g 

g" 

8 

u 14: 

CD 

VO 
VO 



SUBSTITUTE SHEET (RULE 26) 



wo 02/048174 



PCT/nSOl/48690 




SUBSTITUTE SHEET (RULE 26) 



wo 02/D48174 



6/6 



P€TAIS01/48690 



EMIT 




WAVELENGTH- '■ ► 



A = DONOR EXCITATION PEAK 
B = DONOR EMISSION 
C = ACCEPTOR EXCITATION 
D - ACCEPTOR EMISSION 

FIG. 5 
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